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Problem  Statement 


This  research  project  has  involved  studying  methods  for  Synthesis  and  Simulation 
of  robotic  motions.  The  emphasis  has  been  on  the  development  of  a  scientific  foundation 
for  design  synthesis  as  well  as  computer  graphics  based  high  performance  simulation  for 
multi-degree  of  freedom  motions  in  general  and  robotic  systems  and  components  in 
particular. 


Background 

Military  applications  of  robot  manipulators  are  usually  different  from  the 
commercial  manufacturing  applications  in  that  robots  are  used  in  the  unstructured  field 
operations  where  there  is  a  need  for  long  reach,  redundant  degrees  of  freedom,  high 
payloads  and  on  site  calibration  and  targeting.  Commercially  available  robots  are  usually 
designed  for  the  more  structured  manufacturing  applications  and  therefore  are  not 
suitable  for  such  field  operations.  Military  robots  are  therefore  custom  designed  for  each 
application.  This  means  that  there  is  a  need  for  a  set  of  scientific  design  and  planning 
tools  that  can  be  used  in  the  initial  design  and  the  subsequent  field  testing  of  military 
robots.  This  research  deals  with  development  of  scientific  fundamentals  that  can  provide 
the  foundation  for  the  development  of  such  tools. 


Summary  of  Results 

The  main  results  of  this  research  has  been  published  (or  is  being  published)  in  the 
technical  literature.  A  summary  of  these  publications  is  provided  in  the  next  section. 
Here,  we  provide  a  summary  of  some  of  the  main  results. 

The  work  performed  under  this  project  can  be  divided  into  two  parts:  one  dealing 
with  issues  associated  with  motion  synthesis  meaning  design  synthesis  of  robotic 
motions,  sub-components,  and  systems  and  one  dealing  with  high  performance  computer 
graphics  simulation  of  robotic  and  general  multi-degree-of-freedom  motions.  Several 
results  have  been  derived  and  development  in  each  of  these  broad  areas  that  have  not  only 
extended  the  state  of  knowledge  in  robotics  and  mechanical  system  design  but  have  also 
provided  impact  in  other  areas  such  as  computer  graphics  and  computational  geometry. 
Each  of  these  two  parts  are  described  in  separate  sub-sections  below. 

Results  in  Design  Synthesis 

In  the  area  of  design  synthesis  we  have  published  seven  technical  papers  (Papers  1- 
7  in  the  next  section).  We  have  developed  a  general  computational  geometric  structure 
for  mechanical  motion  synthesis  that  can  be  used  to  design  mechanical  linkages  (see 
publication  No.  1).  Since  spherical  linkages  can  be  used  to  model  robotic  wrists,  we  have 
applied  this  technique  to  the  design  of  spherical  linkages  representing  robotic  wrists 
mechanisms. 

In  the  case  of  redundant  manipulators,  we  have  developed  a  set  of  kinematics 
design  criterion  that  can  be  used  to  generate  singularity  free  motion  trajectories. 
Kinematic  design  arrangements  have  been  studied  for  redundant  manipulators  that  would 
produce  singularity  free  trajectories  within  the  workspace.  Redundant  manipulators  have 
extra  degrees  of  freedom  that  can  be  used  in  reaching  around  objects  or  in  avoiding 
singularities.  These  manipulators  are  specially  useful  in  Army  applications  involving, 
for  example,  camouflage  painting  of  Amy  vehicles  or  in  rapid  repair  of  Army  facilities. 
In  the  past  there  exists  very  little  knowledge  on  how  to  globally  design  such  manipulators 
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to  assure  that  singularity  free  trajectories  will  exist  for  any  set  of  path  requirements  within 
the  workspace  of  the  manipulator.  In  this  research  an  approach  based  on  Lie  algebra  is 
used  and  the  kinematic  problem  for  a  redundant  robot  is  formulated  as  a  control  problem. 
This  problem  is  then  used  to  define  a  set  of  kinematic  design  criterion  that  can  be  used  to 
check  suitability  of  a  design  for  singularity  avoidance.  The  results  are  summarized  in 
publication  No.  2. 

Another,  general  set  of  results  in  design  synthesis  has  dealt  with  motion  design  for 
instantaneous  performance.  A  powerful  tool  used  in  instantaneous  kinematic  design  is 
the  set  of  so-called  "instantaneous  invariants".  These  quantities  are  used  to  either  study 
differential  properties  of  motions  or  to  design  for  certain  differential  kinematic 
performance.  The  problem  for  applying  instantaneous  invariants  to  three  dimensional 
linkages  such  as  robot  manipulators  is  that  their  derivation  is  based  on  the  use  of  the  so- 
called  canonical  coordinate  system  which  is  difiScult  to  derive  for  spatial  linkages.  In  this 
research  we  have  developed  a  method  for  the  derivation  of  instantaneous  invariants  which 
are  independent  of  the  use  of  canonical  coordinate  system.  This  is  described  in 
publication  No.  3. 

In  terms  of  more  specialized  research  in  design  synthesis,  our  emphasis  in  this  work 
has  been  on  the  design  of  drive  systems  and  their  components  for  robotic  systems. 
Several  applications  of  robotic  for  military  field  operations  require  design  of  specialized 
drive  systems.  This  includes  for  example  design  and  development  of  specialized 
automatic  weapon  loading  systems.  Here  we  have  developed  a  set  of  scientific 
techniques  for  design  and  sizing  of  the  drive  system  of  robot  manipulators  including  the 
joint  actuators  and  ball  screw  type  drive  mechanisms. 

In  this  area,  we  have  developed  a  method  for  sizing  the  joint  actuators  of  a  robot 
manipulator.  The  problem  is  formulated  as  an  optimization  problem  where  the  actuators 
are  selected  to  achieve  a  desired  dynamic  load  carrying  capacity  of  the  manipulator  over 
an  entire  trajectory.  Worst  case  trajectories  in  the  robot  workspace  are  then  used  to 
generalize  the  results  to  joint  actuator  selection  over  the  entire  robot  workspace. 
Appropriate  constraints  are  included  in  the  problem  formulation  to  prevent  actuators  from 
being  overloaded  and  to  limit  their  sizes.  The  results  are  summarized  in  publication  No. 
4. 

In  most  robotic  systems,  electric  actuators  are  usually  coupled  with  some  form  of 
geared  transmission  mechanism.  There  has  been  some  previous  work  on  gearing  systems 
but  very  little  work  exist  in  understanding  of  ball  screw  mechanisms  which  are  also 
commonly  used  in  several  robotic  systems.  These  mechanisms  are  also  used  in  other  one 
or  two  degree  of  freedom  automation  systems  used  by  the  US  Army  such  as  the  tilt  table 
ammunition  loading  system.  We  have  developed  the  basic  kinematic  model  of  the  ball 
screw  mechanism  and  have  shown  that  much  of  the  existing  results  in  the  literature  are 
based  on  false  assumptions.  We  have  also  included  in  our  model  the  effect  of  contact 
deformation  and  have  identified  the  pattern  of  sliding  lines  of  contact  for  wear  and  finite 
element  analyses.  We  have  used  the  results  of  our  kinematic  analysis  to  develop  methods 
for  design  and  efficiency  analysis  of  such  mechanisms.  The  work  developed  under  this 
project  represents  the  first  comprehensive  and  mathematically  correct  treatment  of  the 
subject.  The  results  are  summarized  in  two  separate  publications  No.  5  and  6. 

Since  every  drive  system  for  a  robot  manipulator  has  to  use  some  sort  of  a  ball 
bearing,  this  research  has  also  looked  into  some  aspects  of  the  analysis  of  the  high  speed 
thrust  Ball  Bearings.  A  method  is  developed  for  design  of  Ball  Bearings  that  would 
results  in  optimum  contact  angle  and  minimum  friction.  The  method  is  specially  useful 
for  high  speed  applications  and  is  described  in  detail  in  publication  No.  7. 

Results  in  High  Performance  Computer  Graphics  Simulation 

In  the  area  of  simulation  of  robotic  motions,  this  research  has  made  several 
fundamental  contributions  in  both  kinematics  and  dynamics  simulation  as  well  as  in  the 
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area  of  calibration  and  referencing  for  simulation  purposes.  The  results  in  addition  to 
their  applications  in  robotics  are  considered  as  fundamental  contributions  to  the  fields  of 
Computer  Aided  Geometric  Design,  Computer  Graphics,  Kinematics  and  Dynamics. 

IN  the  area  of  kinematic  simulation  of  robotic  motions,  a  scientific  foundation  is 
developed  for  motion  inte^olation  based  on  the  use  of  a  kinematic  mapping  for  spatial 
kinematics.  Both  analytical  (publication  No.  8)  as  well  as  discrete  computational 
geometric  methods  (publication  No.  9)  are  developed  for  design  of  Bezier  type  motions 
that  can  interpolate  a  series  of  control  configurations  of  an  end-effector.  In  robotics, 
generation  of  Cartesian  trajectories  requires  interpolation  of  specified  target 
configurations  of  the  end-effector.  In  computer  simulation  of  large  mechanical  systems, 
generation  of  tightly  spaced  displacements  of  a  moving  body  along  its  trajectory  may  not 
be  cost  effective.  Instead,  a  series  of  displacements  are  generated  using  the  dynamic 
equations  of  motion  and  then  the  in-between  displacements  can  be  constructed  using 
appropriate  motion  inteprolants.  In  computer  graphics,  motion  interpolation  is  a 
fundamental  problem  in  animation.  In  this  research  we  have  development  the  first  very 
fundamental  analysis  of  the  motion  interpolation  problem  (publications  No.  8  and  9)  and 
for  the  first  time  have  developed  a  completely  coordinate  independent  method  for 
interpolation  of  rotations  (publications  No.  10  and  11).  We  have  also  been  able  to 
develop  the  first  generation  of  the  so-called  Bezier  curves  in  nonlinear  or  curved  spaces 
of  manifolds  of  rigid  body  motion  (publications  No.  10  and  11).  In  dealing  with 
simulation  of  robotic  motions,  the  kinematic  inteprolation  techniques  developed  would 
allow  smooth  animation  of  robot  motions  without  the  need  for  solving  the  differential 
equations  of  motions  at  all  display  sampling  intervals.  This  results  in  a  simulation 
capability  that  would  require  much  less  computer  power  as  the  more  traditional 
approaches. 

Any  CAD  (Computer  Aided  Design)  based  robotic  simulation  system  requires  some 
level  of  calibration  capability  before  it  can  be  utilized  for  military  field  applications.  As 
part  of  this  research,  a  method  is  developed  for  field  calibration  and  targeting  of  robot 
manipulators.  The  calibration  approach  would  allow  a  robot  to  use  targets  in  a  site  to 
reference  itself  to  its  operational  field  environment.  The  algorithm  developed  uses 
Clifford  algebra  to  exploit  the  geometry  of  the  CAD  model  and  uniquely  assigns  a  body 
fixed  coordinate  system  to  the  computer  model  of  the  object.  This  is  important  in 
World  Model  calibration  for  off-line  robot  programming  and  simulation  (see 
publication  No.  12)  where  the  computer  model  has  to  update  its  object  models 
automatically  from  sensor  measurements.  The  results,  in  addition  to  their  robotic 
applications,  can  also  be  useful  in  artillery  aiming  and  target  acquisition  problems  in 
smart  Weapon  systems.  In  addition,  we  have  studied  the  mechanics  of  automatic 
targeting  using  different  sensor  arrangements  and  have  developed  a  method  for 
reconstructing  object  or  target  locations  based  on  redundant  sensor  measurements  of 
different  geometric  features  (see  publication  No.  13).  Although  the  problem  lends  itself 
to  a  non-linear  least  squares  problem,  we  have  been  able  to  develop  a  formulation  that 
reduces  the  problem  into  a  linear  problem  suitable  for  fast  computations.  The  results,  in 
addition  to  their  application  in  calibration  of  robotic  and  mechanical  system  simulation 
software  packages  have  applications  in  automatic  targeting  for  smart  weapon  systems. 

The  method,  in  a  specialized  form,  has  been  tested  by  a  specialized  medical 
instrument  manufacturer  (Accuray  in  Silicon  Valley,  California)  for  a  robotic  system 
for  Stereolaxic  Radiosurgery.  The  results  have  been  promising  and  has  the  potential  in 
improving  the  accuracy  of  their  system  and  reducing  the  calibration  requirements  for 
their  device. 

A1 1  of  the  work  described  so  far  in  this  section  has  dealt  with  kinematic  simulation. 
In  the  area  of  dynamic  simulation,  our  activities  have  focused  on  development  of 
methods  for  the  formulation  of  dynamical  equations  of  motion  for  computational 
purposes  and  dynamic  interpolation  of  motion  trajectories.  In  terms  of  formulation  of 
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equations  of  motion  for  dynamical  systems,  we  have  developed  a  method  that  would 
result  in  first  order  decoupled  equations  of  motion  for  multi  body  mechanical  systems. 
Such  equations  can  then  be  more  easily  integrated  resulting  in  the  computational 
performance  needed  for  high  performance  computer  graphic  simulation.  The  results 
developed  apply  to  not  only  robotic  systems  but  also  to  the  more  general  class  of 
mechanical  systems  with  multiple  degrees  of  freedom,  and  made  of  multiple  bodies. 
The  decoupling  method  involves  an  algorithm  for  selecting  the  generalized  coordinates 
used  to  define  the  configuration  of  a  dynamical  system.  This  choice  of  the  so-called 
generalized  coordinates  guarantee  that  the  resulting  dynamical  equations  of  motion 
become  decoupled  in  the  highest  derivative  terms.  The  algorithm  uses  congruency 
transformation  and  constraint  relaxation  to  achieve  the  first  order  decoupling  of  the 
resulting  equations  of  motion.  In  the  case  of  the  more  complex  dynamical  systems 
where  the  appropriate  congruency  transformation  may  be  difficult  to  obtain,  a  method  is 
developed  based  on  the  use  of  orthogonal  complements  that  still  achieves  the  needed 
computational  efficiency.  The  results  have  been  used  to  simulate  the  dynamic 
equations  of  motion  for  a  robotic  system  on  a  moving  base.  This  is  the  kind  of  robotic 
system  useful  in  Army  field  applications  where  a  robot  for  ammunition  loading  for 
example  can  be  mounted  on  the  back  of  an  Army  truck.  The  work  is  published  in 
publication  No.  14. 

In  the  area  of  dynamic  interpolation,  a  method  is  developed  for  generation  of 
dynamic  equations  of  motion  that  interpolates  two  end  positions  of  an  end-effector. 
The  problem  is  formulated  as  a  two  pointy  boundary  value  problem  and  is  solved  using 
a  numerical  method.  The  results  are  summarized  in  publication  No.  15  and  have 
applications  again  in  robotic  system  simulation  and  computer  graphics  animation  of 
rigid  body  motion. 
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Preface 


This  volume  contains  a  selection  of  papers  presented  at  the  13th  World  Congress  on 
Computation  and  Applied  Mathematics.  This  Congress  was  organized  by  IMACS  in  Dublin, 
Ireland  on  July  22-26,  1991.  The  best  papers  in  the  areas  of  artificial  intelligence,  expert 
systems,  and  symbolic  computing  were  selected  along  with  applications  to  scientific  com¬ 
puting.  About  one-third  of  the  papers  presented  in  these  areas  at  the  World  Congress  were 
selected. 

These  48  papers  provide  an  excellent  overview  of  the  dynamic  state  of  these  closely  related 
fields.  We  foresee  that  scientific  computation  will  involve  symbolic  and  artificial  intelligence 
tools  more  and  more  as  these  software  systems  become  more  and  more  sophisticated.  The 
future  systems  of  computational  science  and  engineering  will  be  problem  solving  environ¬ 
ments  created  with  components  from  numerical  analysis,  computational  geometry,  symbolic 
computing,  and  artificial  intelligence.  The  historical  separation. of  these  fields  will  gradually 
blur  as  they  come  together  to  create  the  high  level,  natural  systems  of  the  future. 

We  thank  Georgia  Connaroe  and  Connie  Wilson  for  their  excellent  help  in  organizing 
the  correspondence  for  the  papers  in  this  book.  We  also  thank  Robert  Vichnevetsky  for  his 
encouragement  to  us  in  editing  this  volume. 

Elias  N.  Houstis  and  John  R.  Rice 
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lem  into  the  problem  of  geometric  curve  design  problem.  This  is  made  possible  using  a 
geometric  mapping  introduced  by  Ravani  and  Roth  (1984).  The  mapping  transforms  a 
mech^cal  motion  in  the  Euclidean  space  into  a  curve  in  a  non-EucUdean  space  referred 
to  as  the  linage  space  of  the  mapping.  Ravani  and  Roth  used  the  mapping  and  solved 

In  oS^tice  "design  problem  by  a  curve  fitting  procedure. 

pr^tice,  the  number  of  specified  points  for  the  curve  fitting  procedure  is  usuaUy  smaU 
not  allowing  global  control  on  the  entire  motion  of  the  designed  linkage.  In  this  paper 

developed  for  synthesis  of  analytic  motions  is  used  to 
Sr  points  for  the  curve  fitting  procedure.  Research  is  presently 

T^Z7  S  procedure.  This  is  not  presently  tractable 

rSi  S  described  by  piecewise  parametric  rep¬ 
resentations  used  in  Computer  Aided  Geometric  Design  (CAGD).  ^ 

In  this  paper  the  kinematic  mapping  introduced  by  Ravani  and  Roth  (1984)  is  made 
Sn  2  f  computational  geometric  algorithms.  Using  this 

odr«r  ^ f  "^^PP'OS  and  generalizing  techniques  from  the  field  of  CAGD,  meth- 

imlZ,  r  parametric  curves  that  are  approx¬ 

imations  of  analytic  motions.  In  this  manner,  controlling  the  shape  of  a  curve  iws 

mSScAn'^f  dimensional  synthesis  for  a 

linkage,  the  problem  is  considered  in  two  stages.  The  first  stage  uses  the 

SorT  dnL‘r  specifications  for  a  desired  motion  to  have 

more  globd  control  on  the  design  of  the  linkage.  The  second  stage  uses  a  curve  fittinff 
procedure  to  determine  dimensional  parameters  of  the  desired  lin4e.  The  approach  prc? 

^  geometric  method  for  mechanical  design  synthesis  with  global 

control  over  the  entire  range  of  the  motion  of  the  linkage.  ^ 

P^P®!'  i®  only  to  introduce  the  idea  and  develop  some  of  the 

orobirm  r^T'  ^  complete  solution  of  the 

1  only  Hermite  interpolation  is  discussed.  In  relation  to 

f  of  niechanical  linkages,  only  a  simple  example  involving  dimensional 

synthesis  of  special  spherical  linkage  is  presented. 

the  paper  is  as  follows.  We  first  provide  a  brief  description  of  the 
or  ented  image  space  of  kinematic  mapping.  This  provides  for  a  geometric  representation 
displacements  and  motions  m  terms  of  points  and  curve  segments,  respectively  in 

moti^s^Vr^’  ^  nerimte  interpolation  for  spherical 

motions.  This  can  be  used  to  synthesize  analytic  spherical  motions.  The  approach  is 

then  combined  with  a  curve  fitting  procedure  for  design  of  spherical  linkages.  The  results 
are  only  presented  for  a  special  spherical  four  bar  linkage. 


2  THE  ORIENTED  MAPPING 

Euclidean  three-space  is  a  rotation  about  and  a  translation 

from  nnp^"  V  f  ‘^e  screw  axis.  A  general  displacement  of  an  object 

ancT-b^warJjT  oppositely  oriented  (namely  “forwid” 

“es  Th^ITwa  d ''Tl,  T  oppositely  oriented  screw 

axes.  The  forward  and  backward  screw  displacements  define  two  sets  of  oppositely  signed 
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dual  Euler  parameters  X  =  {Xi,X2,Xz^X^)  and  —X 
amd  Ravani  1991)  where 

X,  =  Sism(d/2), 

X2  =  52  sm(5/2), 

Xz  =  53sm(5/2), 

X,  =  cos(0/2). 

The  symbol  denotes  numbers  of  the  form  d  =  a  4-  ea°,  called  dual  numbers y  where 
e  has  the  property  =  0.  The  dual  numbers  Si  =  Si  +  es°  (z  =  1,2,3)  are  Pliicker  line 
coordinates  representing  the  screw  axis;  the  dual  number  ^  ^  +  ed  is  the  dual  angle 

representing  the  amount  of  a  screw  displacement.  The  sine  and  cosine  functions  of  a  dual 
angle  are  defined  as 

sin(0/2)  =  sin(^?/2)  4-  e{d/2)  cos(^/2), 
cos(0/2)  =  cos(^/2)  -  e(d/2)  sin(0/2). 

More  details  on  dual  numbers,  Pliicker  coordinates  and  dual  Euler  parameters  can  be 
found  in  Bottema  and  Roth  (1979). 

The  dual  Euler  parameters  (1)  can  also  be  written  in  the  form  of  Xi  =  Xi  4-  eXf, 
i  =  1,2,3, 4,  where 

Xi  =  Sisin(^/2), 

X2  =  S2sin(i9/2), 

X3  =  S3sin(5/2), 

X4  =  cos(0/2) 

and 

X°  =  Si  (d/2)  cos(^/2)  4-  Sj  sin(^/2), 

X2  =  S2(d/2)  cos(^/2)  4-  S2  sin(^/2), 

Xl  =  Sz[d/2)  cos(^/2)  +  st  sm{e/2)y 
XO  =  -(d/2)sin(0/2). 

The  parameters  X  =  (Xi,X2,X3,X4)  are  the  Euler  parameters  of  a  rotation  and  the 
parameters  X^  =  (X^jXjjXjjXj)  represents  a  translation  after  the  rotation  X. 

Study  (1891)  used  the  dual  Euler  parameters  in  the  form  of  (X,X°)  and  developed  a 
geometric  representation  for  a  spatial  displacement  in  terms  of  Study  vectors  which  repre¬ 
sent  oriented  lines  in  a  space  of  four  dimensions  called  the  Soma  space,  A  recent  account 
of  the  Study  vectors  can  be  found  in  Bottema  and  Roth  (1979).  Ravani  sund  Roth  (1984) 
used  the  dual  Euler  parameters  in  the  form  of  (1)  as  a  set  of  four  homogeneous  coordi¬ 
nates  to  define  a  geometric  mapping  of  displacements  into  points  of  a  three  dimensional 
projective  space  with  a  dual  metric  called  the  image  space.  Thus  a  one-degree-of-freedom 
mechanical  motion  is  mapped  into  an  image  space  curve;  and  a  two-degree-of-freedom 
motion  is  mapped  into  a  two  dimensional  surface  in  the  image  space.  They  then  solved 
the  linkage  dimensional  synthesis  problem  by  a  curve-fitting  procedure  for  an  algebraic 
curve. 

Ge  and  Ravani  (1991)  made  the  geometric  mapping  of  Ravani  and  Roth  orientable 
by  considering  the  signed  dual  Euler  parameters  as  the  signed  homogeneous  coordinates 
of  an  image  point.  Instead  of  identifying  the  points  X  and  mX  (where  w  ^  w  evP  is 


=  (^Xi,-X2,-X3,-X4)  (see  Ge 

(1) 
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a  nonpure  dual  number)  completely,  Ge  and  Ravani  distinguished  them  according  to  the 
sign  of  w.  They  are  considered  to  be  identical  if  u;  >  0;  and  they  are  two  antipodal  points 
if  ly  <  0.  Two  antipodal  points  are  two  distinct  points  occupying  the  same  position  in  the 
image  space  but  with  “opposite  orientations”,  namely  “forward”  and  “backward”.  In  this 
way  a  general  displacement,  which  can  be  performed  either  by  a  forward  or  a  backward 
screw  displacement,  corresponds  not  to  one  image  point  but  two  antipodal  points.  The 
image  space  consists  of  such  antipodal  points  is  called  the  oriented  image  space.  It  is  a 
double  covering  of  the  original  image  space. 

The  introduction  of  orientation  into  the  image  space  makes  it  possible  to  unambigu¬ 
ously  define  the  distance  between^twojmage  points,  and  the^ directed  line  joining  from  one 
point  to  another.  Let  X  =  (Xi,X2,X3> X4)  and  Y  =  (Yi, >2,  Y3,  V^)  denote  two  oriented 
points.  The  distance  between  them  is  the  dual  angle  d  =  a  +  e/  (where  0  <  a  <  tt) 
obtained  from: 

X- Y 

(X*X)V2(Y- Y)i/2’  w 

where  the  inner  product,  X  •  Y  =  XiYi  +  X2 Y2  +  -V3Y3  +  is  a  signed  dual  number. 

The  norm  of  X  is  \/x  •  X  and  X\s  said  to  have  normalized  coordinates  when  X  ♦  X  =  1. 

The  line  joining  from  X  to  Y  is  given  by  the  wedge  product  X  A  Y.  The  symbol 
“A”  denotes  the  vector  wedge  product  which  generalizes  to  higher  dimensions  the  vector 
cross  product,  see  Flanders  (1963).  The  components  of  X  A  Y  are  the  signed  Plucker 
coordinates  of  a  line.^  Here  the  sign  is  significant  because  —X  A  Y  —  Y  A  X  represents 
a  line  joining  from  Y  to  X.  They  define  two  coincident  but  oppositely  oriented  lines. 
Similarly,  the  orientations  of  a  plane  defined  by  three  oriented  points  and  a  tetrahedron 
defined  by  four  points  can  also  be  unambiguously  defined. 

3  INTERPOLATING  ROTATIONS 

Recent  works  on  rotation  interpolation  that  are  related  to  our  approach  are  Shoemake 
(1985,  1987),  Duff  (1986),  and  Pletinckx  (1989).  They  developed  subdivision  methods  for 
interpolating  rotations  using  unit  quaternions.  Ge  and  Ravani  (1991)  built  on  their  works 
and  developed  analytical  methods  for  interpolating  general  displacements  (including  both 
rotations  and  translations)  with  higher  order  continuities.  This  section  applies  the  re¬ 
sults  in  Ge  and  Ravani  (1991)  to  Hermite  interpolation  of  rotations  which  correspond  to 
spherical  motions. 

Given  two  distinct  orientations  of  an  object  in  space  together  with  two  instantaneous 
rotation  axes  at  these  orientations,  a  cubic  parametric  motion  can  be  designed  using 
Hermite  interpolation.  Essential  to  Hermite  interpolation  is  the  ability  to  define  the 
“tangent  direction”  of  an  orientation.  In  this  paper,  we  define  the  tangent  direction  using 
the  notion  of  absolute  polarity  since  the  image  space  is  a  projective  three  space  with  an 
elliptic  metric.  Let  an  orientation  be  represented  by  an  image  point  X  =  (Xi, X2,  X3,  X4) 
where  Xi  (i  =  1 , 2, 3, 4)  are  given  by  (2).  Its  tangent  direction  is  then  defined  by  a  point  X' 
on  the  polar  plane  of  X  with  respect  to  the  absolution  =  X-X  =  0,  that  is  X'  is  defined 
such  that  X'  *  X  =  0.  Let  p  =  ijpi.p^^Pz)  denote  the  coordinates  of  the  instantaneous 
axis  at  the  orientation  X.  Then  the  corresponding  polar  point  X'  =  {X[f  X^^  X'^)  is 
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given  by 


rx;i 

0  -P3  P2  Pi  ■ 

rX:  -] 

P3  0  -Pi  P2 

1 

0 

X3 

.  -PI  -Vi  -Pz  0  J 

1X4  J 

The  skewsymmetry  of  the  matrix  in  (4)  results  in  X'  •  X  =  0  for  an  arbitrary  p.  Readers 
are  referred  to  Coxeter  (1961)  for  more  explanations  on  the  absolute  polarity  of  the  elliptic 
geometry. 

We  now  turn  to  the  Hermite  interpolation  of  orientations.  Let  Xi  and.X2  denote 
the  coordinates  of  the  image  points  for  two  given  orientations,  respectively.  In  order  for 
the  interpolation  to  be  unique,  these  coordinates  are  required  to  satisfy  Xj  •  Xi  =  1, 
X2  '  X2  =  1,  and  Xi  •  X2  >  0.  Geometrically,  this  means  that  the  coordinates  are 
normalized  and  the  angular  distance  between  Xi  and  X2  is  less  than  7r/2.  Let  pi  and 
P2  denote,  respectively,  the  coordinates  of  two  given  instantaneous  axes.  Then  the  two 
corresponding  points  Xj  and  X2  polar  to  Xi  and  X2,  respectively,  with  respect  to  =  0, 
are  can  be  obtained  using  (4).  The  image  curve  for  the  cubic  motion  that  starts  from 
the  orientation  Xi  with  the  initial  instantaneous  axis  Pi  and  ends  at  the  orientation  X2 
with  the  final  instantaneous  axis  p2  is  obtained  by  the  following  Hermite  interpolation 


P{t)  -  [t^  t  l][M,] 


wiXi  ■ 
u;;X'i 

W2X2 


where  [Mf^]  is  the  Hermite  basis  matrbc: 


^  =  [0,1] 


1 

-2 

1 

0 


1 

-1 

0 

0 


The  Hermite  cubic  image  curve  in  normalized  coordinates  is  given  by 


X{t)  =  F{t)/w{t)  (5) 

where  u;(i)  is  the  norm  of  P(t).  The  weighting  factors  ly,-,  (i  =  1, 2)  provides  additional 
adjustments  to  the  shape  of  the  cubic  curve. 

In  this  manner  an  analytic  spherical  motion  can  be  designed  using  Hermite  interpola¬ 
tion  based  on  specifications  of  a  set  of  discrete  orientations  together  with  an  instantaneous 
axis  at  each  orientation.  The  resulting  piecewise  Hermite  interpolation  is  automatically 
continuous.  Using  the  differential  geometry  of  the  image  space  (McCarthy  and  Ravani 
1986),  the  instantaneous  poles  can  be  chosen  such  that  the  interpolated  motion  possess 
higher  order  continuities  (Ge  and  Ravani  1991). 

This  approach  is  not  restricted  to  Hermite  interpolation.  Other  forms  of  interpolation 
can  also  be  utilized.  For  example,  rather  than  specifying  two  displacements  together  with 
two  instantaneous  poles,  four  displacements  can  be  specified  and  interpolated  using  B&ier 
interpolation.  Ge  and  Ravani  (1991)  developed  a  general  mathematical  framework  for 
designing  parametric  motions  by  generalizing  techniques  in  the  field  of  CAGD. 
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Figure  1:  A  spherical  kite  linkage 


4  SPHERICAL  KITE  MOTION 

The  spherical  kite  linkage  is  a  spherical  four  bar  mechanism  with  two  pairs  of  neighboring 
links  having  the  same  angular  length,  see  Figure  1.  Let  the  angular  length  of  each  link  be 
denoted  by  7  for  the  ground  link,  77  for  the  coupler  and  pi,  P2  for  the  two  cranks.  In  this 
paper  it  is  assumed  that  7  =  p,  and  77  =  pi-  The  kite  linkage  has  two  folding  positions, 
the  left  and  the  right. 

It  is  known  in  the  theory  of  kinematic  mapping  that  the  image  curve  for  the  coupler 
motion  of  a  spherical  kite  linkage  consists  of  aline  and  a  cubic  curve,  see  Ge  and  McCarthy 
(1991).  They  correspond  to  two  “modes”  of  the  coupler  motion;  the  line  represents  the 
continuous  rotation  of  the  coupler  about  a  faed  pivot;  and  the  cubic  represents  the  cubic 
rotational  motion  of  the  coupler.  The  line  and  the  cubic  intersect  at  two  points  which 
correspond  to  the  left  and  right  folding  positions. 

To  derive  the  constraint  equations  needed  for  the  design  of  a  kite  linkage,  we  consider 
the  coupler  77  to  be  the  link  common  to  the  two  two-link  open  chains  on  the  left  and  on 
the  right.  We  assume  the  fixed  and  moving  pivots  of  the  linkage  to  lie  on  a  umt  sphere 
and  we  attach  a  moving  frame  M  to  the  coupler  and  a  fixed  frame  F  to  the  ground. 
•Let  the  unit  vectors  u^  =  {ui,Vi,Wi)  (i  =  1,2)  define  the  positions  of  two  fixed  pivots 
in  F,  and  mi  =  {m,,ni,U)  (i  =  1,2)  define  the  positions  of  two  moving  pivots  m  M. 
Let  X  =  {XiyX2iXzjX4,)  denote  the  coordinates  of  the  image  point  for  a  rotation  of  the 
coupler  link.  Then  the  coupler  as  the  link  common  to  the  two  open  chains  is  constrained 
by  the  following  two  equations  quadratic  in  X: 

Q,  :  X^[Qi]X  =  0,  1  =  1,2 


(6) 
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where  the  coefficient  matrices  [Qi]  (i  -  1,2)  are  given  by 


r  lUiUi  —  riiVi  —  liWi  —  cospi 


niUi-\r7niVi 


liXii+rriiWi 
riiWi  —  liVi 
liUi+rriiWi 
l^Vi+UiVJi 

-^iUi  TLiVi  +  UWi  —  COS  pi 
TTliVi-niUi 


-^niiUi'^niVi  —  liWi  —  cos  pi 
liVi^riiWi 
liUi-rriiWi 
UiWi-liVi 
kui-rriiWi 

rruVi—^iV^ 

miiLi‘\-niVi-{‘liWi  —  cos  pi , 


The  set  of  image  points  X  satisfying  (6)  are  said  to  define  a  quadric 

space,  see  Ravani  and  Roth  (1984).  The  quadric  surface  can  S  sht  of 

a  right  circular  real  hyperboloid  of  one  sheet  (Ge  and  McCarthy  1991). 

the  hyperboloid  is  determined  by  the  crank  length,  p.;  the  position  of  the  hyperboloid  is 

j  fprmined  by  the  locations  of  the  fixed  and  moving  pivots  u<  and  nm-  _ 

The  intersection  of  the  two  hyperboloids  defined  by  equations  (6)  is  the^age  curv 

This  is  due  to  the  two  special  conditions  among  its  link  dimensions:  7 

These  conditions  result  in  the  following  relationships  among  the  design  variables. 

COS  pi  =  ^1^2  ^1*^2  "b  WiVJ2i  (8) 

COS  p2  =  TUim^  +  niTi2  +  hk- 

Therefore  the  shape  and  location  of  the  two  hyperboloids  and  thus  their  intersection 
curve  is  determined  by  eight  independent  variables  from  the  unit  vectors  Ui,  uj,  1 
The  sobstiwtien  ot  (8)  into  (7)  resells  m  the 

eurve,  tooled  as  K(Xi  u„  The  problem  °f 

linkage  is  then  reduced  to  that  of  selecting  the  unit  “i  <' -  1'^) 

KiX;  u„  U2,  mi ,  m^)  approximates  the  image  curve  of  a  desired  motion. 

5  DESIGN  SYNTHESIS  OF  A  KITE  LINKAGE 

TVaditionally  a  desired  motion  is  specified  by  a  set  of  discrete  posRions  so  that  the 
I^tual  c^Sr  mori^^  may  best  guide  through  them.  This  specification,  however  doe 
^ot  havTmuch  global  control  over  the  coupler  motion.  In  this  section,  we  ap^y  the 
HermiTe  inTrpoL^^  method  for  design  of  an  analytic  cubic  motiom  the^^ 

•  analytic  motion  as  a  desired  motion  and  use  a  curve  fitting  procedure  to  design  ‘he  l^ge 
parameters.  We  have  chosen  the  kite  spherical  four-bar  linkage  since  its  image  curve 
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a  cubic  curve  and  the  Hermite  interpolation  method  that  we  have  developed  is  also  for 
cubic  curves.  In  theory,  the  results  can  be  generalized  to  any  linkage,  for  example,  by 
using  Bezier  ,  B-spline  or  Cardinal  spline  approximation/interpolation. 

We  start  the  specification  of  a  desired  motion  by  a  set  of  image  points  representing 
discrete  positions.  We  then  design  a  cubic  curve  of  desired  shape  by  piecewise  Hermite 
interpolation  of  these  points.  The  resulting  polar  points  at  each  given  point  is  then  used 
as  additional  design  specifications  for  the  desired  coupler  motion. 

Lgt  y.  2,  ■  •  • ,  n)  denote  a  set  of  image  points  representing  desired  onentations, 

and  let  Y|  denote  the  polar  points  representing  the  desired  tangent  directions.  Then  the 
problem  of  designing  a  kite  linkage  is  to  choose  the  unit  vectors  ui,  uj,  mi,  and  nuj  suA 
that  at  least  one  measure  of  the  approximation  error  of  Y;  and  YJ  from  the  twisted  cubic 
i'r(X;ui,U2,  mi, m2)  is  minimized. 

Ravani  and  Roth  (1983)  defined  the  approximation  error  at  a  desired  image  point  as 
the  normal  deviation  of  the  point  from  the  actual  image  curve  and  developed  a  normal 
curve-fitting  technique  for  motion  approximation.  In  what  follows  we  modify  the  approxi¬ 
mation  error  defined  by  Ravani  and  Roth  to  include  the  prescribed  tangent  requirements. 

Let  Xi  X-  denote,  respectively,  a  point  of  the  twisted  cubic  K  and  its  polar  pomt 
with  respect  to  K.  Let  AXi  =  Xi  -  Yi  and  AX;  =  X'  -  Y'  denote  the  deviation  vectors 
where  Yi  and  Yi'  are  the  desired  image  point  and  its  polar  point.  The  normal  deviation 
defined  by  Ravani  and  Roth  (1983)  satisfy  the  following  linear  equation: 

[A]AXi  =  qi  (9) 


where 


[Al  = 


2YnQii  1 
2YUQ2] 

2YT 


and 


q,  =  {-YnQi]Yi  -Yf[Q2lYiOF. 


Elq.(9)  is  a  linear  approximation  of  K  near  the  desired  image  point  Yj. 

The  polar  point  X(  of  Xi  with  respect  to  the  cubic  K  is  defined  by  the  four  dimensional 


unit  vector: 


-(IQilXiAlWAX,) 
^  "  l-([QilXiA[C?2lXiAXi)r 


(10) 


The  symbol  “*”  denotes  the  star  operator  for  multivectors  which  transforms,  in  this  case, 
a  multivector  of  rank  3  into  a  vector,  see  Flanders  (1963).  The  symbol  1  •  |  denote  the 
Euclidean  norm  of  a  vector.  By  substituting  X<  =  Y(  -b  AX(  and  Xi  =  Yi  +  into 
(10),  we  expand  (10)  about  the  polar  point  Y-  at  Yj.  This  yields,  after  discarding  higher 
order  terms  involving  AXi  atnd  AX^, 


‘  (IQ  ilYiA[Q2lYiAAXi)  +•  (YiA[Qi]  YiA[Q2]  AXi)  -b*  ([(32]YiAYi  a[(3i]  AXi)  -b  CAX(  =  pi. 


where 


Pi  =  CY;--  i[Qi]YiA[Q2]YiAYi) 
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and 

C  =\‘{{Qi]YiA{Q^]YiAYi)\. 

Eq.(ll)  which  approximales  the  specified  polar  point  can  also  be  rewritten  as 

[BJAXi  +  CAX'  =  Pi.  (12) 

Eq.(9)  together  with  (12)  is  a  system  of  seven  linear  equations  with  eight  imknowns. 
The  minimum  norm  solution,  (AX*,  AX'*),  of  this  system  is  the  one  that  minimizes  the 
following  Lagrangian  function: 

L  =  AX'f  AXi  +  AX'f  AX'i  +  Ar([/llAXi  -  qi)  +  A^^BjAXi  +  CAX'  -  Pi), 

where  Ai  and  Aj  are  vectors  of  Lagrange’s  multipliers.  The  approximation  error  function 
at  each  point  Yi  that  includes  both  orientation  and  tangent  requirements  is  thus  defined 

e;  =  AX*^AX*  +  AX'.'^'AX'*. 

The  total  error  is  obtained  by  summing  over  all  points  Y;  (i  =  1,2,  •  •  ■  ,n): 

E  =  f^(AX*'"AX*  +  AX'f  AX'*). 

i=l 

Standard  routines  for  nonlinear  least  squares  optimization  can  then  be  applied  to  obtain 
the  design  parameters  u;,  mi  (i  =  1,2). 

CONCLUSIONS 

This  paper  demonstrated  that  methods  from  continuous  computational  geometry  can  be 
used  with  the  aid  of  an  orientable  kinematic  mapping  for  mechanical  motion  synthesis. 
The  initial  ideas  presented  form  the  basis  for  more  research  that  would  bring  together 
the  fields  of  computational  geometry  and  kinematic  design. 
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A  Kinematic  Design  Criterion  for 
Singularity-Avoidance  in  Redundant  Manipulators 
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Abstract 

In  this  paper,  we  study  a  set  of  conditions  that  de~ 
fines  a  kinematic  desi^  criterion  for  a  redundant  robotic 
manipulaibr  in  order  that  it  generate  nngularity-fiee 
trajectories.  The  design  criterion  is  enforced  on  a  Malar 
fimction  on  the  joint  space  that  is  positive  everywhere 
.except  at  ningulAr  configurations  where  it  vanishes,  and 
a  normal  vMtor  fimction'  that  hfts  the  paths  away  from 
the  iringiilu  confignratk^.  A  theorem  is  stated  specify' 
ing'the  derign  mteriou' on  the  fimcti<»^  and  is  proved  for 
a  g<m«ral  m  degree  of  freedom'  robot  in  an  n  dimensional 
space.  The  conditions  are  global  in  the  sense  that  when 
^ese  conditions  are  met  ibr  a  given  robot,  any  path  in 
its  workspace  can  be  generated  avoiding  singulariti^.  An 
example  is  presented  that  demonstrates  how  to  identify 
the  functions  to  evaluate  a  design. 

Keywords:  Redundant  Robot  Manipulators,  Design  of 
Robots,  Singularities,  Singularity-Free  Control,  Kine¬ 
matic  Control. 

1  Introduction 

The  control  of  redundant  manipulators  has  become  a  popular 
topic  of  research  in  the  fidd  of  robotics.  Redundant  robot  mar 
nipulators  have  been  consid^ed  the  solutions,  to  the  problems 
of  trajectory  planning  which  is  often  plagued  by  singularities  in 
case  of  noDxedundant  robots.  The  idea  is  that  the  available  ex¬ 
tra  degrees  of  freedom  can  be  used  to  avoid  singularities.  In  this 
paper,  we  devdop  a  km^atic  criterion  to  evaluate  the  dedgn 
of  a  robot  in  a  i^obal  sense  in  terms  of  its  ability  to  generate 
angularity-free  paths.  -  ^  ^  •  -  '  -  ' 

:  ..  Past  research  in  robotics  has  focussed  in*  devdoping 'algo¬ 
rithms'  to  avmd  angdlarities';for  leduidant  robots:  Whitney 
(1969)  firist  pro£^8ed  reviving  redundandes  in  robots’ usmg 
pseudoinverse  techniques  which  were  extended  by  Many  re^ 
searches  rach  as  Kldn  and  Huang  (19^)rKldn  (1984)VHbIler- 
bach  and  Suh  (1985).  In  particular,  Yoshilmwa  (1984,' 85)  de¬ 
fines  a  ManipulabUity  index*  based  on  the  Jacobian  of  the  robot. 
'This  index  vanishes  at  the  sm^arities  and  assumes  large  values 
away  from  them.  By  maxin^ng  the  mainpul^Uity  index  at 
every  point  of  the  trajectory,' his  algotrithm  keeps  the  robot  as 
far  as  possible  from  the  singular  configurations.  Long  and  Paul 
(1992)  devdop  an  algorithm  that  treats  the  redundant  robot 
as  a  nonredundant  robot  by  locking  the  extra  joints.  When 
the  equivalent  nonredundant  robot  reaches  its  singular  config¬ 
urations,  the  extra  degrees-of-freedom  are  used  to  move  away 
from  the  singularities  in  an  algorithmic  fashion.  In  both  these 
cases,  the  singularity' avoidance  is  guaranteed  locally  if  there 
erists  such  a  path.  However,  there  is  no  a  prion  information 
on  the  existence  or  noneristence  of  such  paths,  nor  there  is  any 
singularity-avoidance  in  a  global  sense.  In  this,  paper,  we  focus 
our  attention  to  the  conditions  that  identify  robot  designs  that 
can  always  provide  singularity-free  movements.  We  show  that  in 
our  general  and  ^obal  treatment  of  the  problem,  we  encompass 
Yoshikawa’s  result  as  wdl. 


A  derigner’s  ^proadi  to  avoiding  angularities  is  to  design 
a  singuiarity-free  robot.  There  have  been  many  attempts  in  the 
past  in  designing  robots  with  minimum  number  of  singularities. 
Soon  it  became,  obvious  that  angularity-free  workspaces  are 
nonexistant  in  nonredundant.  as  well  as  redundant  robots  (which 
is  proved  gang  topolo^c^  arguments  by  Gottlieb  1986,  and  fid- 
lowed  by  Wampler  1987, 89,  and  Baker  wd  Wampler  1987,  ^). 
However,  one  can  expect  that  certain  derigns  *have  the  c^a- 
bility  of  .  avoiding  angularities  for  any  path  in  the  workspace. 
This  paper  presents  conditions  that  would  allow  testing  a  kirie- 
niatic  design /to  .seeif  it.  has  the  usability  to  produce,  globally, 
angularify-free  trajectories;  ;:.;:. .  ,, 

In  what  follows;  we  fir^  formulate  ,  the.  problem  of  inverse 
kinematics' of  a.  general  m  degree  of  freedom  robot  as  a  contrd 
problem  in  the  velodty  domain.  Then  we  present  the  functions 
that  play  the  key  role  in  our  result  followed  by  some  examples 
on  how  to  choose  theise  functions.  A  theorem  is  then  stated 
and  proved  that  enumerates  the  conditions  these  functions  have 
to  meet  that  guarantees  the  existence  of  nonsingular  paths.  A 
second  theorem  follows  as  a  modification  of  the  first  theorem  to 
ease  the  conaputatiohal  aspects.  Finally,  we  consider  the  exam¬ 
ple  of  a  four  jointed  spherical  wrist  (considered  by  Yoshikawa 
1984,  85.  arid  Long  arid  Paid  1992,  Long  et  ^  1992)  and  evalute 
its  deagri  uang  our  Icinematic  design  criterion. 

2  The  Inverse  Kinematics  i;Fro.blem 

A  robot  manipulator  conaats  of  several  rigid  bodies  connected 
by  various  Jypes  of  joints.  Several  representations  have  been 
developed  in  jtiie  robotics  literature  ,to  represent,  the  Inter- 
connec^ons  of  these  ri^d  bodies,  and^  the  rdl^ionship  of  the 
end-T^ector  to  the: base  of  .^e  robot.  One  of  seemed  to 
have  emerged.,  as.  .more  popMax'^  as  geometric,  4  x  4  ma¬ 

trices'  uang  l^.J>enayit-Har^  p^ame1^.s..  .We  use  this 
i^resent^cm  wh^  follows,  however,,  the  theory  is  y^  gen- 
eral.andindependent  of  the  nepre^tation  sch^e.  .. 

Let  .us  conader  a  robot  with  m  degrees  of  freedom.  Each 
degree  of  fireedom  denotes  the  capability  of  one  degree  of  free¬ 
dom,  jomt.  In  g^eral,  each  joint  can  either  be  revolute  or  pris¬ 
matic.  If  thMe  are  r  revdute  joints  and  p  prismatic  joints 
where  m  =  p  +  r,  and  the  joint  variables  are  denoted  by 
q,',t  =  l,2,...,m,  then  the  4  x  4  transformation  matrix  that 
defines  the  location  and  orientation  of  the  end-efiector  to  the 
frame  is  given  by  the  kinematic  equation: 

T  =  Tt(«i)r,fe)-T„(^)  (1) 

where  T;  is  a  4  X  4  transformation  matrix  describing  the  kine¬ 
matic  relationship  between  the  tth  link  and  its  next  one,  the 
(i  +  l)th.  Its  actual  form  depends  on  the  geometry  of  the  robot, 
and  is  given  by 

'  cos  (9,-  -  sin  cos  a;  sin  0,- sin  OiCosOi 
^  _  an  6i  cos  6i  cos  a,-  ~  cos  6{  sin  ct,-  o;  sin  6i 

*  ”  0  smcq  cosa,-  d,  ’  ^  ^ 

0  0  0  1 


Figure  1:  Denavit-Hartenberg  parameters  associated  with  a 
robot. 

where  are  the  Imk  parameters,  and  are  the  joint 
paranieterSy  t<^ther  knowh  as  Dehavit-Hiutenberg  Parameters 
(Figure  1  shows  these  parameter).  The  degree' of  freedom  9,* 
will  be  equal  to  0i  if  the  ith  joint  is  levolute,  or  to' if  it  is 
prismatic.  ^  -  .  i  v  ;  , 

The  matrix  T  defines’ the  location  and  orientation’  of  the 
endneffector  of  the  robot  with  re^ct  to  itr  base,  that  is,  it  in¬ 
volves  a  3  X  3  rotation  matrix/ and  a  trandation  vector^’  There¬ 
fore,  this  matrix,  in  general,  spans  the  space  of  three  dimen¬ 
sional  displacements,  known' as  £uc/idean  (?nntpk£(3),' whidi  is 
a  obmpodtion  of  the  group  bf  rotations  ^0(3)  and  the  group  of 
trandation8T(3),therefore- 

15(3)  =  50(3)  X  T(3).  '  (3) 


where  the  Q  denotes  the  derivative  with  respect  to  t  and  d/|^  = 
d//dr.  Note  that  now  we  have  transformed  the  problem  into 
the  velocity  domain,  that  is,  the  mapping  d/|x  relates  the  points 
in  the  tangent  space  TAf  of  the  joint  space  to  the  tangent  space 
TN  of  the  configuration  space  of  the  end-effector,  d/|x  :  TM 
TN. 

Both  Eqs.  (4)  and  (5)  represent  the  inverse  kinematics  prob¬ 
lem.  Given  a  path  in  y(t)  in  the  configuration  space,  the  goal 
of  inverse  kinematics  problem  is  to  find  z(t)  for  all  t.  However, 
at  some  locations,  the  mapping  /  becomes  singular,  that  is  (y) 
may  not  be  in  the  image  of  d/  as  it  is  not  of  full  rank  at  sin¬ 
gularities.  Our  goal  is  to  find  x{t)  such  that  it  does  not  pass 
through  any  of  the  singularities. 

Therefore,  we  state  the  problem  of  inverse  kinematics  as 
follows: 

Given  y(t)  €  iV“,  find  a;(t)  =  /“^(y(t))  6  Af  \  5  for  all  t,  where 
M\S  is  the  joint  space  excluding  the  singular  configurations  S 
defined  as 

5  =  {x  €  Af  I  rank  (d/k)  <  n}. 

3  Inverse  Kinematics  as  a  . Control  Prob¬ 
lem' 

We  transform  the  above  mmtibnbd  invem  kineniarics  problem 
into  a  control  problem  which  eyentu^y  )deldsrthe  .dedgn  cri¬ 
terion  we  have  b^ looking  for.., Moreoyer,  this  appro^  also 
results  in  a.  control  that  avoids  angularities,  though  the  topic  of 
this  paper  is  just  to  present  the  design  criterion.  - 

We  first  define  tl^  functions  that  are  needed  in  the  follow¬ 
ing  theory: 

1.  a  scalar  function  or  a  parameter  p  ;  Af  -♦  [0,oo)  with 
p(ar)  =  0  if  and  only  if  x  €  S; 


This  group  of  spatial  displacements  can  be  parameterized  by 
six  independent  parameters;,  typically  by  three  Mgles  of  rota¬ 
tion  about  independent. axes  and  three  trahriation  parameters, 
and  hence  represent  a  6- dimenrional.  manifold,  ;^owu  as  config¬ 
uration  manifold  of  E(S)y  oi  uk  this  ca«  amf^ntion  manifold 
of  the  end-effector.  The.  functions  that  define  jhe  di^Iac^ent 
matrix  are  infinitely  diffef^tiable  with  respect  to  the  parame¬ 
ters,  therefore  the  manifold  is  smooth. 

A  set^th^  ^  both  u  group  wd  a  smooth  manifold  on  which 
composition  and  inveraon  appear  as  smooth  maps  is  a  Lie 
group,  ■  The  configurition  inanifold  bf  the  ehd^ectbr  is  a  group 
and  a' mooth  manifold,  mbreovex,  the  oompbatioh  arid  inver- 
aori^aie^defin^  in  texixm  of  the  matrix'  multiplication  arid  ma¬ 
trix  inversiori  as  smooth  iriaps,  hence"  it  fbnrik  ai  cdm^ci'  Lie 
‘Samuel  et^al  ;(l99i)  for  more 'det'^^^  To'be  g^m- 
er^  arid  'to'  iridude  the  tasea^'^bf  i>lariar,''1^hericd/  ’brid  spatial 
robotis;  we'note  that,  the'  of  riny 

dizderisibu,'  riot  hecessaiily  lieV'the  dml^atibri  &adifbid 

be  «“dimenfflqriai  and  •tt;'d^pt'ed*l)y  '  joint 

spa«  forms  ah  m-^enribhal  ma^  dehOted 'by  ,  also  a 
conipact’’‘l4e'''grbup;'  ■■■'  ■■ 

^  Therefore'^  ^the' kineiri^^  e^atibris  (Eql  1)  is  a  mapping 
from  Ad  fb’Wj  Or’spiboUcaUy,^^^"  ^  f' 


y^fix)-  ‘  (4) 


where  y  denotes  the  position  of  the  end-effector,  and  x  denotes 
the  values  of  the  joint  degrees  of  freedom.  The  mapping  /, 
known  as  forward  kinematics,  relates  the  joint  variables  to  the 
end-effector  porition,  and  its  inverse  inverse  kinematics, 
maps  the  position  of  the  end-sector  to  the  joint  variables.  As 
is  very  well-known  in  the  robotics  literature,  the  forward  kine¬ 
matics  is  single- valued,  whereas  the  inverse  kinematics  is  not, 
due  to  the  non-linearity  of  the  equations,  therefore,  f  :  M  N 
is  an  onto  mapping. 

We  differentiate  Eq.  (4)  with  respect  to  time  t  to  obtain 


y  =  d/Ui 


(5) 


2.  a  vector  lifting  function  A  :  (Af  \  5)  x  TN  -*■  TM  such 
that 

(a)  A(x(f),y(f))  is  a  right-invariant  vector  field  on  Af, 
and 

o>) .  . . 

^  '  .  J6) 

3.  B(f)  :  [0,oo)  — ►.TAd  suA  tha^‘ 

(a)  B(t)  is  a  right-invariant  vector  field  on- Ad,  and  ^ 

(b) '  -o.-.;  ^ 

^  H(t)U(t)ekcr(^l^q)-^^^^  ^ 

The  right-invariance  of  the  vector  .fields^  At  and  .  is  not  nec- 

ess;^.  for  T^e  proofs  of  the  theoremS'presented  m 
However,'  they,  are  useful  in,  deyd^ii^ :^e  a^nal  ront^^ 
trades  angularity-free  trajectories,  which  is  the  subject  of  a'  fo- 
turepaper..;  -^-7;  i  • 

The  inverre  kineinatics  problem  is  to  be  able  to  identic  x(f) 
at  every  inrtwt,  therefore,  x(t)  can  bq  expressed  as  .. 

.  \  ,  (8) 

Substittiing  for’ X  from,  this  equation  in  the  right  hand  side  of 
Eq.  (5),  we  sbe  that 

d/|xx(t)  =  d/UA(x(t),  y(t))  -I-  d/UH(0 

which  can  be  amplified  using  the  definitions  of  the  functions 
(Eqs.  6  and  7)  to 

d/|xx(t)  =  y(t)  -f  0. 

Therefore,  the  solution  x(t)  of  the  differential  equation  (8)  is  in 
fact  the  solution  of  the  inverse  kinemaitics  problem  specified  in 
Eq.  (5). 

Our  goal  in  solving  the  inverse  kinematics  problem  is  to  find 
the  control  jB(t)  such  that  the  solution  x(«)  6  M  \  5  starting 
from  x(0)  =  /~^(y(0))  for  all  t. 


Examples  of  Choice  of  p  and  A 
Example  I:  Deime 

p(z)  =  det(J(t)J^(x)). 

A(i,y)  =  J+y, 

m  = 

where  and  J  is  the  Jacobian  of  the  robot.  Note 

that  the  scalar  function  p  is  zero  only  when  x  ^  S  and  otherwise 
is  positive.  The  vector  function  A  satisfies  its  properties  as  well. 
This,  as  a  matter  of  fact,  is  Yoshikawa’s  algorithm. 

Example  2:  The  following  gives  a  more  computationally  effi¬ 
cient  method  of  finding  p  and  A: 

•  define  p(z)  =  where  Ux  is  the  smallest  eigenvalue  of 
JJ^; 

•  define  A  by  solving  the  equation  d/|x(A(x,y))  =  y  by  sin¬ 
gular  value  decompodtion. 

p  is  now  only  continuous,  not  smooth.  A  is  the  same  as 
before,  but  Jipweyer  this  dafinitinn  does  not  require  the  compu¬ 
tation  of  the  mat^  V'*'  ,  and  is  deiced  even  when  x  €  S, 

4  Conditions  for  the  Existence  of 
Singularity-Eree  Paths 

The  following  theorem  lists  the  conditons  on^the  existence  of 
singularity-free  paths  for  a  pven  design  of  robot:. 

Theorem  1  Let  f  :  M  N ,  p  :  M  [0,  oo),  end  A  as  before, 

and  assume  that  f  satisfies  the  following  three  conditions: 

1.  <r  >  0; 

t.  there  is  an  integer  p  suck  that  xoAf  (y)  =  p  for  y  &  N; 

3.  there  is  an  open  neighborhood  W  of  the  identity  element  of 
M  such  that  for  any  y^N  and  any  x  ^  M,  the  translate 
xW  ofW  intersects  M(y)  in  at  most  OTie  point; 

where  a  and  M(y)  are  defined  by 


Figure  2:  Schematic  of  a  four  jointed  spherical  wrist. 

Therefore,  th^  conditions  can  be  conddered  as  the  criterion  to 
evaluate  .the  dedgn  of  an  existing  robot  or  as  the  dedgn  criterion 
to  be  nse^  iri  the  proo^  of  dedgning  a  new  r^ot. 

Tn  practipe,  the  hypotheses  of  Theorem  1  are  t<M.mdrictive 
for  t^ic^  real  robots;  they  can,  however,  be  relaxed  somewhat 
as  we  show  in  the  fdiowing  Theorem. 

Theorem  2  Suppose  Af  =  Afi  U  Af2  vjith  p  constant  on  and 
/(Afj)  contained  in  a  submanifold  of  N  of  dimension  at  most 
(m-i).  Write  Nx  =  /(Afi).  Define  a  as  before  and  assume 

1.  a  >0; 

2.  xoAf  (y)  =  p  for  all  y  €  Ni; 

$.  there  is  an  open  neighborhood  W  of  the  idenitity  element 
of  M  such  that  for  any  y  ^  Ni  and  x  ^  M,  the  translate 
xW  of  W  intersects  M{y)  in  at  most  one  point. 

Then,  given  y{t)  and  €>0,  there  is  a  control  vector  B(t)  such 
that  ifxo  €  M(f(x)),  the  solution  to  Eg.  (S)  satisfies  p{x(t))  > 
(1  —  €)a  for  c/ft. 


igr{  sup  p(p} 
»(») 


and  ■ 

-  .s?P  KO)  .;;: 

. 

for  each  y  €  iV,  and  where  iro(Af(y))  is  the  number  of  path 
cdmpondnts'ofthe$et'M(y),  .  .  '  i  vrh'. 

Let  y :  [0,oo)  — ►  TN  be  a  ixmUnuous  fuiictioiC^Tii^  given 
any  c  >  0,  'ih!en'  is  a  l^^  vector  B{t)  swH  ikai  if  xq  is 
in  M{f{zo))  -then  the  solution  to  Eg.  (8)  sdti^a  p{z{t))  > 
{l^€)a,forallt.  •  ' 


.  The  first  ,  of  the  conditions  is  a  necessary  condition  ior  the 
existence  of, a  singularity-free  solution.,  a  will  be  strictly  greater 
than  .0  if  and  only  if ,  for  every  y  €  iiV,,/T^(y) -contains  points 
lying  outdde  the  dgular  set  5,  as  at.  the  singularities  p  is  iden¬ 
tically  equal  to  .zero  as  per  its  definition.  The  second  condition 
requires  that  the  number  of  maximal  points  of  p  . restricted  to 
/"'(y)  be  constant  for  all  y  €  N.-  This  number  is  the  number 
of  paths  ayaU^le  as  the  nondngular  paths.  Finally,  the  third 
condition  guarantees  that  these  different  nonsingular  paths  axe 
nonintersecting  which  guarantees  that  there  is  tubular  range 
around  each  of  these  paths  that  are  nondngular. 

When  all  these  conditions  are  met,  the  theorem  states  that 
starting  from  xq  €  /”^(yo)  one  can  follow  a  path  arbitraily 
close  to  p(x)  =  c  for  all  y  €  Af.  A  brief  proof  is  presented  in 
the  Appendix  A  (see  Sardis  1988  for  a  more  rigorous  proof). 

Note  that  when  a  robot  satisfies  these  conditions  for  any 
arbitrary  dioice  of  the  functions  p  and  A,  the  robot  is  guaranteed 
to  follow  any  trajectory  in  its  workspace  avoiding  singularities. 


Notice  that  this  theorem  is  a  dight  modification  of  the  eariier 
one.  The  second  and  third  conditions  deal  with  only  part  of  the 
configuration  manifold  of  the  end-effector  instead  of  the  .whok 
manifold,  that  is,  Ni  instead  of  N.  This  is  posdble  because  the 
parameter  p  is  constant  on  Af2.  ’  We.  do  not  present  the  proof 
for- this  theorem  in  tUs  paper,  however  it  is  dmilar  to  the  proof 
of  Theorem  1.  The  following  examples  demonstrate  how  this 
theorem  can  be  used..  ' 

5  An  Example 

Figure  2  shows  a  four  jointed  ^herical  wrist.  Note  that  the 
four  axes  intersect  at  one  peunt  which  is  known  as  the  wrist 
center  point  (WCP).  Therefore,  the  last  link  (the  end-effector) 
only  rotates  (does  hot'  translate)  with  respect  to  the  first  link 
(the  base).  The  configuration  manifold  is  hence  50(3)  and  is 
of  3  dimensions.  The  transformation  that  denotes  the  position 
of  the  end-effector  relative  to  the  base  is  given  by 


COS^i 

—  sin  0 

■  1 

0 

0 

sin^i 

cos^i  0 

0 

cos  ^2 

-  sin  $2 

0 

0  1  J 

0 

sin  ^2 

cos  ^2 

cos  03 

0 

sin  Oz 

COS^4 

—  sin  64 

0  ' 

0 

1 

0 

sin  ^4 

C0S^4 

0 

-  sin  ^3 

0 

cos  ^3 

L  0 

0 

1 

The  right  hand  side  of  the  above  equation  denotes  the  mapping 
f:M^N  where  z  =  (^1,^2,  ^3,  ^4). 

We  now  need  to  define  the  parameter  p,  it  has  to  be  positive 
valued  everywhere  in  M  except  at  the  singularities  where  it 


vanishes.  Singularities  occur  for  this  robot  when  two  pairs  of 
joint  axes  line  up,  that  is  when  TN  loses  its  rank  from  3  to  2. 

We  will  use  a  different  parameter  than  those  given  by  earlier 
examples.  Choose  unit  vectors  01,02,03,  and  04  parallel  to  the 
joints  of  the  wrist,  and  define 

p(i)  =  majc(|«i  X  03I,  Im  X  (kI)  (10) 

where  e,',>  =  1,2, 3, 4  are  fdndtions  of  x.  Note  that  this  ex- 
pression  becomes  zero  only  when  oi  is  parallel  to  03  and  02  to 
04,  otherwise  it  is  equal  to  the  maximum  of  the  absolute  values 
of  the  sines  of  the  angles  between  the  two  pairs  of  the  axes. 
This  is  the  same  condition  for  the  singularities  as  the  four  axes 
becoming  coplanar. 

For  a  given  y  and  a  value  of  $1,  the  inverse  kinematics  prob¬ 
lem  yields  two  solutions,  analogous  to  el  bow- up  and  elbow-down 
configurations  of  a  three  link  manipulator.  Therefore,  as  we  vary 
.through  its  range,  the  inverse  kinematics  solution  of  the  four 
d^ree  of  freedom  spherical  wrist  are  obtained  as  two  curves  in 
the  joint  i^ace,  both  parameterized  by  0i .  Ot^ese  are  al^  known 
as  fibers  associated  with  the  mapping  /"'  :  N  hi.  Note 
that  attains  maximum  when  the  an^es  between  01,03,  and 
02,04  become  i^j  whi^  implies  that  Hcn«;  thc*fim<i- 
tion  >(*)  remains  cbnstjedxt  and  equal  to  1  if  oi  is  pv^d  to;  04, 
and  otherwise  attains  lour  local  maxima  on  eaui  fib«^  ' 
Now,  if  y  is  an  orientation  liaving  oi  paralld  ib  04  then 
iK/^^Cy))  cbnstajit'  and  nion-z^o,  therefore  we  '^define 

M2  -  x  a4==0}. 

Ml  ^  T  ' 

It  can  be  seen  that  p  has  a  constant  value  1  on  all  of  Af2,  and 
/{Mj)  has  the  dimension  1  as  it  is  the  set  of  all  rotations  about 
oi  or  04. 

fi  =  xq Af  (y)  is  the  number  of  local  maxima  for  all  y  6  iV”! 
which  is  four  on  each  of  the  two  fibers,  that  is  /i  =  8.  Finally,  if 
we  define 

=  {*  I  l^il  <  45Mtf2|  <  9(f } 

the  third  condition  of  Thereom  2  is  satisfied  as  well.  There¬ 
fore,  according  to  Theorem, 2,  given  any  smooth  trajectory  y(t), 
we  can  track  y(t)  by  a'  continuous  change  !of  qonfigur^on  x(<) 
which  will  keep  p(z)  arbitraily  close  to  1,  that  is,  the  wrist  will 
never  encouter  , 

6  '  Conclusion 

In  this  paper, -we  have' formulated  Uie  -inverse  Idhematics  probr 
lem  in  ^  controls  setting.*  In  this  ^settiiig,'  bur  god  is  to  inves¬ 
tigate  if  the' redundant*  robot  *ciui'tra^^  any  trajectory 'in  its 
workspace  avoiding  the  singularitiesi  Two' theorems  ^edfy^the 
conditions  that  ;^bb;iy  guarantee  the  existejice  of  such  paths 
regardless  of  the  path  of  the  end-effectOT  .under^  epnaderatiOn. 
We  have  also  applied  the  theorems  to  tie  foui>jcmW'^heric^ 
wrist  that  has  appeared^  the  literature  before. 

The  existence  of.  the.  control  vector  J?(t)  is,  guaranteed  to 
avoid  the  angularirie8,,howeyer,  we  haven’t  determined  the  con¬ 
trol  S(f)  actually  guides  the  robot  on  the  deari^d -trajec¬ 
tory,  This  will  be  the  topic  of  a  future  paper. 
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A  Proof  of  Theorem  1 

First  we  consider  the  following  two  Lemmas. 


Lemma  1  Suppose  F  is  a  smooth  manifold,  Xo(t),...,Xt(f) 
STTUMth  vector  fields  on  F,  and  assume  that  Xo(< 
span  the  tangent  space  T^F  for  all  x  e  F  and  for  all  values 
of  t.  Let  U  be  an  open  set  in  and  let  <ht  :  U  F  be  a 
continuous  one-parameter  family  of  open  maps.  Assume  Xq  € 
Then  there  are  continuous  control  parameters  uj{t)  sdeh 
that  the  solution  to 

x(0)  =  xo 

satisfies  x(t)  6  4h{U)  for  a// 1  >  0. 

Proof:  Without  loss  of  generality,  Aro(t)  =  0  rina.^e  Xj 
span  the  tangent  space  of  F.  Take  z>€  U  such  that  <^(z)  =  xq 
and  define  a  curve  a(f)  :=  ^(z).  Then  ot(t)  is  a  continuous 
curve,  with  a(t)  €  4h(^)'  Since  ^(U)  is  open  and  ^(z)  is 
continnous  in  f,  a  can  be  approximated  by  a  smooth  curve  x(t) 
with  x(0)  =  Xo  and  x(0  €  ^(U).  Define  Ui(t)  :=  X,*(<)(x(t)); 
then  x{f)  =  since  Xi  span  r*F. 

The  connection  between  Lemma  1  and  Theorem  1  is  given 
by  the  foUow^  Lemma  and  Definition.  .  , 

Dbmma'  2  For  any  8  >  0  there  is  an  opm  neighborhood  V  'of  i 
in  Mf  such  that  for  driy  x  in  M  and  anyz  €  xV,  \p(x)^p(t)\  < 

h’-  1  ■ . 

*  -  Proof:  FSist,  for  every  x  €  Af  take  a  n^borhiiof  W*  of  1 
with  b(*)  -  P(x)l  <  S/2  whenever  z  this  is  p<»oble 

once  p  is  a  continouons  function.  Then  an  open 

cover  of  A/,  and  Af  is  compact;  so  there  is  a  finite'lrabcover 
{xiWi,...,XiWi;}.  Define  V  :=  Wi f) , . . f| W*.  Now  take  an 
X  in  Af  and  a  point  z  in  xV.  x  €  xjWj,  for  some  j;  and  so 
z  €  (xjWj)V  <  xj(}Vj)\  Then 

\pix)  -  p(z)  <  1p(x)  -  p(x,*)|  |p(x,)  p(z)l  <  8/2  +  8/2  =  8. 

Let  f  :  M  N  and  A  be  as  before  with  7  a  smooth  path  in 
N.  Pick  a  time  t  €  [0,oo)  and  let  F  be  a  subset  of  /“'(t)* 
Assume  ,  that  F  does  not  contain  any  singnlar  points.  Let 
/(  be  a  subinterval  of  (0,oo)  containing  f.  Then  if  /<  is  small 
enough,  for  any  x  €  F  we  can  lift  7[/iJ  to  a  path 

^(0)  =  X  *  V 

4x(0)  =  A(7x(0,7(<p. 

This  gives  a  ^  :  F  x  /<  -►  Af  by  ^x,s)  =  7^(s).  Denote 
the  image  of  rf)  by  Tt. 

Definition  1  Tt  is  said  to  he  a  tubular  interval  F  (unih 
Respect  to  f,Xf  and  'if)  if  iff :  F  x  It  ^  M^  is  a  homeomorphism. 

Proof  of  Theorem  1:  It  is  suffiqent  tq  prove  the  wait  for 
tm  ^atflutr^  qpnipact  intw  v  V  ^ 

.V  \  t:€  lb,r],x  e  A^y(0),:th^^i8^  4 

of  all  ina^al  pointe  of  p(i)  in  the,  fibers,  overly  in  >th€,  space 
if  X  [0,r].  if  is  a  closed  set.  Define  projj  :  Af  X  [Q^r] 
by  projection. 

Suppore  -F,*  is  a  connected  component  of  K;  by  the  third  con¬ 
dition  of  the  theorem,  projj  is  one-to-one  bn  Af,*,  once 
ojvers  Ki:  Also,  AT,*  is  closed  since  AT  is.  So  proj2  gives  a  p-fold 
cover  of  [0,f  by  closed  sets.  ^  ' 

Now  let  Kq  be  the  componant  of  AT* -containing  the  point 
(xo,0);  we  want  to  show  that  projjCATo)  is  all  of  [0,r].  So, 
suppose  that  proj2(A:o)  =  [0,  tq]  for  some  tq  <  r.  Define  Ki  to 
be  the  umon  of  all  other  path  componants  of  K  containing  a 
point  in  /~'(ro)x{To}.  But  then 

Ri  :=  projjCAfo) 

R2  proj2(ii:\A:i)\(ro,r) 

axe  disjoint  dosed  sets  whose  union  is  all  of  [0,  r];  so  since  [0,r] 
is  connected,  projjCATo)  is  all  of  [0,r]. 

Now,  since  proj2(Aro)  =  (0,r]  we  can  define  a  continuous 
funcUon  q  :  [0,r]  Af  by  the  relation  (a(t),t)  €  Ko  for  t  € 


We  now  define  a  set  and  collections  of  smooth  manifoids 

Fi  and  open  maps  4>\:U  fi,  and  apply  Lemma  1, 

First,  by  Lemma  2,  choose  a  ndghborhood  V  of  1  in  Af  such 
that  for  any  x  e  M,  \p{x)  —  p(z)|  <  an  whenever  z  €  iV.  In 
particular,  if  z  €  o(f)V’  for  any  t  then 

pW  >  P(a(0)-««' 

>  (7  —  ca 

= 

Let  17  be  an  open  set  with  1  €  U  and  the  closure  of  U  contained 
in  V. 

fbr  every  t  6  [0,t]  choose  an  open  interval  It 

containing  t  such  that  It  defines  a  tubular  interval  Tt 
about  a(t)V’n/'’^(y(0)»  suffidently  small  that 

Is  for  all  s  €  It-  Then  It  is  an 

open  cover  of  the  compact  interval  [0,r;  take  a  finite  sub  cover 
JtJk,  with  0  =  to  <ti--<h  =  T, 

Define  Fi  :=  a(<t)V' fl /"HyM)  i  =  0,1, Also, 
define  4i:U  Fi  for  t  6  [tt,tt+i]  by  4i{w)  :=  gi{a{t)v))  where 
Pt(<)  :  Tti  -►  /“Hy(0)  t)y  projection  in  Tt,-,  for  t  €  [ti,ti+i]. 

R)r  t  €  define  Aro(t)  on  Fi  to  be  the  projection  of 

d^,( A).  Pick  vector  fields  Xi , . . . ,  X*  spanning  Fi, 

Now  define  z(t)  :=  ji(ti)x(t);  then  z(t)  is  a  function  with 
image  lying  in  Fi  for  t  €  By  Lemma  1  there  are  scalaras 

t*i , , . . ,  «jt  such  that  the  solution  to  i  =  Xo  +  ]C  inside 

4(U)  for  aU  t  €  Define  B(t)  €  r(a(t)Vn /-*(»(<))) 

by  _ 

Then  the  solution  i(t)  to  the  equation  x{t)  =  A(i(t),y(t))+jB(t) 
satisfies  Pi(i(0)  €  ^{U)  for  all  t  €  or  equivalently, 

x{t)  €  a(t)U 

for  ail  t  €  Thus 

p(«(0)  >  (1  - 

for  all  t  €  Since  this  can  be  done  for  each  t  = 

0,1,... ,  k,  we  have  p(x(t))  >  (1  —  €)<t  for  all  t. 
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Coordinate  System  Independent 
c  Lee  Form  of  Instantaneous  Invariants 

A.T.  Y,„,  in  Spatial  Kinematics 

This  paper  presents  explicit  equations  for  the  instantaneous  invariants  of  spatial 
kinematics  that  are  coordinate  systems  independent.  This  eliminates  the  ne^  for 
laborious  coordinate  transformations  necessary  in  the  determination  of  the  canonical 
coordinate  systems.  It  provides  for  a  direct  method  for  the  circulation  of  instan- 
taneous  invariants  based  on  information  specifying  the  rigid  body  motion  in  any 
arbitrary  task  coordinate  systems.  Explicit  equations  are  presented  for  instantaneous 
invariants  up  to  the  third  order  for  spatial  and  spherical  motions,  and  up  to  the 
fourth  order  for  planar  motions.  The  results  provide  for  a  useful  tool  in  design  and 
analysis  of  mechanisms  and  motions  based  on  instantaneous  invariants.  Examples 
are  presented  to  illustrate  the  theory. 


B.  Ravani 

Computer  Integrated  Design  and 
Manufacturing  Laboratory, . 
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Introduction 

Instantaneous  invariants  and  canonical  coordinate  systems, 
since  their  introduction  by  Bottema  (1961)  and  Veldkamp  (1967 
and  1976),  have  become  a  powerful  tool  in  differential  kine¬ 
matic  an^ysis  and  synthesis.  Roth  and  Yang  (1977)  auid  Gupta 
(1978),  among  others,  have  applied  them  to  analysis  and  syn¬ 
thesis  of  planar  mechanisms..  Kirson  and  Yang  (1978)  as  well 
as  McCarthy  and  Roth  (1982),  Nayark  and  Roth  (1981)  have 
extended  the  concept  of  instantaneous  invariants  and  canonical 
coordinate  systems  to  a  study  of  spatial  rigid  body  motions 
and  mechanisms.  A  detailed  account  of  the  concept  of  in¬ 
stantaneous  invariants  and  the  canonical  coordinate  systems 
for  planar,  spherical  and  spatial  motions  can  be  found  in 
Bottema  and  Roth  (1979).  In  robotics  literature,  StaniMc  and 
Pennock  (1986)  have  ^plied  the  concept  to  the  study  of  dif¬ 
ferential  motions  of  robot  manipulators. 

All  existing  work  on  determination  of  instantaneous  invar¬ 
iants  require  the  use  of  the  so-called  canonical  coordinate  sys¬ 
tems.  This  usually  involves  tedious  coordinate  transformations 
necessary  to  represent  the  specified  quantities  in  the  canonical 
coordinate  systems.  In  this  paper,  we  present  concise  expres¬ 
sions  for  the  instantaneous  invariants  using  an  arbitrary  set  of 
moving  and  fixed  coordinate  systems.  This  eliminates  the  need 
for  the  computations  associated  with  the  use  of  the  canonical 
coordinate  systems.  The  derivations  are  based  upon  the  concept 
of  kinematic  mapping  introduced  by  Ravani  and  Roth  (1984). 
A  rigid  body  motion,^  specified  in  any  coordinate  system,  can 
be  mapped  into  an  image  curve  in  the  space  of  the  mapping. 


‘in  this  work  wc  arc  only  considering  single  degree  of  freedom  motions. 
Contributed  by  the  Mechanisms  Committee  and  presented  at  the  Design  Tech¬ 
nical  Conference,  Chicago,  IL,  Sept,  16-19,  1990,  of  The  American  Society 
OF  Mechanical  Engineers.  Manuscript  received  March  1990. 


The  mapping  is  defined  such  that  the  intrinsic  properties  of 
the  image  curve  are  invariant  with  respect  to  coordinate  trans¬ 
formations.  This  property  then  enables  us  to  calculate  the 
instantaneous  invariants  directly  from  the  intrinsic  properties 
of  the  resulting  image  curve.  Although  the  intrinsic  properties 
of  an  image  curve  can  be  used  to  charaaerize  the  corresponding 
rigid  body  motion  instantaneously  (see  McCarthy  and  Ravani, 
1986),  the  use  of  the  instantaneous  invariants  has  been  (at  least 
up  to  the  present)  more  familiar  to  kinematicians.  This  is  why 
we  are  presenting  the  explicit  expressions  for  the  instantaneous 
invariants.  The  main  contribution  of  this  work  is  therefore  in 
providing  a  set  of  explicit  expressions  for  computations  of  the 
instantaneous  invariants  of  a  given  rigid  body  motion  without 
the  need  for  the  use  of  the  canonical  coordinate  systems.  This 
is  useful  for  design  and  analysis  of  mechanisms  as  well  as 
comparing  rigid  body  motions.  Expressions  are  presented  for 
the  instantaneous  invariants  of  spatial  and  spherical  kinematics 
'  up  to  the  third  order  and  for  the  planar  motions  up  to  the 
fourth  order.  Numerical  examples  are  used  to  illustrate  the 
results. 

The  organization  of  the  paper  is  as  follows:  first,  we  give  a 
very  brief  overview  of  kinematic  mapping  and  derive  the  gov¬ 
erning  equations  in  terms  of  the  3  x  3  dual  matrix  represen¬ 
tation  of  spatial  displacements.  We  then  derive  the  differential 
properties  of  the  image  curve  of  a  motion  in  terms  of  the 
differential  values  of  the  elements  of  the  3  x  3  dual  matrix 
representation  of  spatial  displacements.  This  enables  us,  in  a 
subsequent  section,  to  directly  arrive  at  the  expressions  for  the 
dual  instantaneous  invariants  (up  to  the  third  order)  that  are 
independent  of  the  canonical  coordinate  systems.  We  then 
provide  two  examples,  an  epicyclic  hypoid  gear  train  and  a 
RCCC  mechanism,  to  illustrate  the  theory  for  spatial  motions. 
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The  spherical  results  can  fall  directly  from  our  spatial  results 
using  the  principle  of  transference  (Kotelnikov,  1895,  and 
Study,  1903).  We  have  therefore  eliminated  any  further  dis¬ 
cussions  for  spherical  motions.  In  the  case  of  planar  motions, 
we  specialize  the  mapping  and  develop  the  results  with  the 
same  detail  as  in  the  general  spatial  case.  Since  the  mathematics 
is  less  tedious  in  this  case,  we  develop  the  instantaneous  in¬ 
variants  up  to  the  fourth  order  and  use  a  numerical  example 
for  illustrative  purpose. 


Notations 

A  dual  number  is  denoted  by  an  angle  symbol  on  top  of  a 
letter.  For  example,  3  =  b  +  e  b  in  which  b  andja  are  referred 
to,  respectively,  as  the  real  part  and  dual  part  of  The  symbol 
€  designates  the  dual  unit  with  the  property  ^  =  0.  In  the 
study  of  instantaneous  kinematics  we  are  interested  in  the 
motion  of  a  rigid  body  at  a  particular  instant,  referred  to  as 
the  zero-position  (Veldkamp,  1967).  The  order  of  the  time 
derivative  of  functions  evaluated  at  zero  position  (t  =  0),  is 
denoted  by  the  last  subscript.  For  example  R2  =  R(0)  =  [A"i2, 
X-a,  Xi2,  032  =  ^3(0),  a2i  =  «2(0),  etc.  The  value  spec- 
ified  in  canonical  system  is  denoted  by  a  tilde  symbol  under 
a  letter  such  as  225. 


Kinematic  Mapping 

According  to  Chasles  theorem,  the  displacement  of  a  rigid 
body  can  be  uniquely  desOTt^d  by  rotating  a  dual  angle  ^ 
about  a  screw  axis  S  =  (S,,  Sy,  S^).  The  rigid  body  displacement 
can  then  be  mapped  into  a  point  in  the  dual  image  space  E 
(see  Ravani  and  Roth,  1984).  The  coordinates  of  the  point  are 
defined  by 

R^{X,JC2JCyJC^) 

=  ^S;^ini,S,Sin|,SzSin^,Cosij  (1) 


Consider  a  rigid  body  in  continuoi^  motion  in  Euclidean 
space.  In  such  a  case,  the  screw  axis  S  and  the  dual  angle  ^ 
are  functions  of  time  t.  Equation  (1)  will  then  represent  a  du^ 
curve  in  terms  of  the  real  parameter  t  in  the  dual  image  space 
£.  We  refer  to  the  image  curve  R(tyas  a  dual  image  curve  in 
the  dual  image  space  £. 

Let  SK  be  an  arbitrary  chosen  coordinate  system  attached 
to  the  moving  body  and  ^  be  a  system  fixed  in  Euclidean  space. 
The  transformation  from  911  to  9,  which  describes  the  rigid 
body  motion,  may  be  expressed  as  a  dual  matrix  {V eldkamp, 
1976) 


7l 

A==  0:2  p2  72 
as  ^3  73 


where  the  dual-elements  are  continuous  functions  of  the  real 
parameter  t. 

Using  Rodrigues  formula,  we  may  show  that  the  dual  angle 
^  and  the  screw  axis  S  =  S^)  may  be  expressed  as  the 

elements  of  the  dual  matrix  A: 


Cos^=^(a,-h.32+73-l),  Sx=^(^3-72), 


Substituting  Eq,  (3)  into  Eq.  (1),  we  have  the  parametric 
form  of  the  image  curve"R(t)  written  in  terms  of  the  elements 
of  the  dual  matrix  A: 


Differential  Properties  of  Image  Curves 
In  this  section  we  review  the  differential  properties  of  an 
image  curve  which  are  independent  of  the  coordinate  systems 
in  Euclidean  space.  These  geometric  quantities  are  then  used 
to  derive  the  explicit  expressions  of  the  instantaneous  invar¬ 
iants.  The  formal  proof  of  the  independent  coordinate  prop¬ 
erties  of  the  kinematic  mapping  is  given  m  Ravani  and  Roth 
(1984).  For  a  description  of  the  differential  geometry  of  the 
image  space,  readers  may  refer  to  McCarthy  and  Ravam  (1986) 
and  McCarthy  (1986  and  1987). 

Using  prime  to  denote  differentiation  with  respect  to  the 
dual  arc-length  we  may  write  the  Frenet  formulas  for  the 
base  frame  (t,  N,  B,  E)  as  . 

T/1  [0  K  0  -il  Ft 

N'  _  -ic  0  f  0  N 

B'  "  0  -T  0  0  .  B 

,E'J  [1  0  0  0  J  [e  ■ 

With  the  aid  of  Eq.  (5),  we  obtain  the  derivatives  of  the  image 
curve  R  with  respect  to  t,  namely: 

R  =  E 

R=yt 

(6) 


R  =  [^ *-  ( 1  -I-  -H  (3ui)/c  +  y^«)N  +  -  3vvt 

where  dot  denotes  differentiation  with  respect  to  the  parameter 

t  and  y  =  q  = 

From  Eq.  (6)  we  observe  that  the  dual  functions  v,  k  and 
T  characterize  the  differential  geonietry  of. R(t)- Using  the  no¬ 


tations 


R(0)  =  Ro, 


iK0)^Vo 


-  Vn  etc. 


We  may  write  the  Taylor  expansion  of  the  column  veaor  R(t) 
at  a  reference  point  (t  =  0)  as 

.  -  .  t^  ^  t^ 

R(t)  =  Ro  +  Rit  +  R2  + B3  ••••• 


Using  Eq,  (6)  the  above  equation  may  be  written  as 
0  &o  +  ^ 

0  0  3&o^&i  +  i^i 

0  ^  0  0  2'^  6 

1  0  -€o  L  -Svoh 

The  above  equation  is  an  intrinsic  representation  of  the 
image  curve  up  to  the  third  order.  The  values  of 
To,  Ki  in  the  above  equation  can  be  written  in  terms  of  the  dot 
and  the  wedge  products  of  Rj(j  =  0,  1,2,  3),  namely 
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i)o  =  (Ri*Ri) 

Ri*R2 


t;j=“ 


Vo 


V2  =  - 


Ri*R3  +  R2*R2  (Rt*R2)"' 


1^0 


Vo 


Ko=- 


[(Rj  aR2A]R^)  •  (R]  aR2ARq)] 


1/2 


(8) 


^0 


^  ci€t(R}  ,R2,R3,Ro) 

Jq  = 

^  {RiAR2ARo)*(Ri  AR3AR0)  ^  Vi  ^ 
- 3  —kq 

VqKo  Vq 

where 

(RiAR2ARo);„*(Ri  AR3AR0);,  =  M  123^123  +  ^ZiA^23A 

+  A5'^4A?i34  +  M^4Mi24 


are  the 


minors  of  the  matrix  [Rj,  R2,  Ro][^  and  matrix  [Ri,  R3,  Rq]^ 
respectively,  and  det  (ft,,  ft2,  fts,  fto)  is  the  determinant  of  the 
matrix  [ft,,  ft2,  fts,  fto]  . 

The  detail  of  operations  of  wedge  product  in  general  form 
can  be  found  in  Flanders  (1963)  and  McCarthy  (1987). 

The  values  of  ft„  can  be  determined  by  differentiating  Eq. 
(4)  with  respect  to  t  and  evaluating  them  at  t  =  0.  We  show 
the  formulas  for  n  -  0,  1,  2,  3,  4. 


Xn  Xji  Xkx 

Xn  Xi, 

Xkx 

II 

^2  ^*2 

and  33'5*= 

^,3  Xfl 

x^ 

XtoXjaX^ 

Xio  Xjo 

(9) 


■3A4.^a-3A-«A-„-X43Jf» 


Xi,=^l^-AX,iXn- 6X,i^Ca  - 


'X^U 


Xm  (“11 +321 +731) 


X42  -  (a  12  +  ^22  +  732  -  8^41 ) 


Xa2  ~  (tti3  +  023  +  733  “  24X41X42) 

^44  =  ^  (ai4  +  324  +  734-24;ej2~32;e4I^43) 


(10) 


Determination  of  Instantaneous  Invariants 

The  dual  matrices  B„  of  the  canonical  systems  in  spatial 
kinematics  were  derived  by  Kirson  and  Yang  (1978).  They  are 
the  transformation  matrices  between  the  moving  and  fixed 
canonical  systems. 


(11) 


go- 

1  0 

0 

0 

-  “21 

0 

0  1 

0 

i,= 

£21 

0 

0 

0 

0 

1 

0 

0 

0 

-  S22 

0 

II 

“22 

-k\^  - 

■^32 

0 

hi 

0 

B3 


(12) 


-3a2lS22  ”^23 

£23  “3a2l£22 

‘2l3  +  3a2i^32  ^33 


2  13 

-I33 

0 


where  a  2i>  £  22*  £  23*  k  32»  ^  33*  X  i3  instanta¬ 

neous  invariants  of  a  given  spatial  rigid  body  motion  at  the 
zero  position  and  up  to  the  third  order. 

^  Substituting  Eqs.  (11)  and  (12)  into  (9)  and  (10),  we  have 
Eo  =  [0,0,0,  Ij^and 


p 

- 

- 

0 

\hi 

2  ^33 

0 

0 

1  .  3  .  g 

2  213-4  S21P32 

1  ^ 

2  “21 

,  R2  = 

1  . 

,  §3  = 

>  *  3 

2  223+1  S21 

\ - 

0 

[-422.^ 

3  .  . 

—  4  221SE22 

(13) 


Substituting  the  above  go,  gi,  ft 2  and  §3  into  Eq.  (8), 


where  i  =1,2,3  and  we  have  the  dual  instantaneous  invariants  in  terms  of  functions 

^  -  of'kn  {n  =  0, 1,2, 3)  in  an  arbitrarypair  of  coordinate  systems. 

^V-/33y~72/.  These  equations  are: 


V2J  =  yiJ-OC3J, 

/  4  \  1/2 

a  21  =  2  Xa) 

(14) 

j  =1,2,  3,4 

and 

“22=-:^  2  CcnXn) 

231  Jn 

(15) 
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^32=—  [(^723)'+  (^S4)'+  (Mw)^+  (A5^724)']'""  (16) 

2121 


S23=^  i];^;iA-s+  O’) 

a  21  [f^  '  /V ^ 


Mj2_Mh 
a  21  a  21 


Xu  Xi,  X,i  X,, 

V  =— li-  A  -  A  ns3 

e2ll32  ^13  ^23  Xj,  ^43  ^  2  ~ ^ 

-^10  -^20  ^30  -^40  I 


^33-  ^  2  S  (^123-^723 +  ^?34-/\^234 
^  21  £  32 


+  M134M1 J4  +  A/j^4M"24  )  (19) 


Xi 

Xa 

Xju 

Xn 

Xfi 

Xiu 

Xa 

Xj2 

Xia  and 

^3 

Xfl 

Xti 

Xio 

^jO 

Xico 

^iO 

Xjo 

x^ 

+(A/r34)^+(A/r24)"]''^  (20) 


3  ^  2  det  (Ri,R2,R3»Ro) 

213=2 ^32 - ; /4  - 


^4  \  3 

S^n) 

./=1  / 


o  (  A/i23A/i23  +  A^S4A^234  +  Af r34A/i34  +  M^2^12A. ) 

£33= - 7*:^ - - 


2(1:4, 

\i*=  1 


3132  S  (^/lA-e) 

_ /=J _ 

/  4  \  3/2 


2 

Planet  gear 


1 

Sun  gear 


Equations  (14-19)  show  the  expressions  of  dual  instanta¬ 
neous  invariants  up  to  the  third  order  for  a  given  rigid  body 
motion  in  an  arbitrary  coordinate  system.  It  is  clear  that  if  the 
matrix  A  in  Eq.  (2)  is  given  then  R„  (/z  =  0,  1,  2,  3)  can  be 
calculated  from  Eqs.  (9  and  10).  The  instantaneous  invariants, 
up  to  the  third  order,  can  then  be  obtained  directly  from  Eqs. 
(14-19).^ 

For  the  case  of  a  spherical  motion,  the  image  curve  becomes 
a  real  image  curve.  The  procedure  to  derive  the  expressions  of 
instantaneous  invariants  for  spherical  motions  can  be  obtained 
by  removing  the  dual  components  as  stated  by  the  principle 
of  Transference  (Kotelnikov,  1895,  and  Study,  1903)  and  the 
results  are  as  follows: 

^32=~T7  7372  [(Af^)^+  (MJLj)^ 


Numerical  Examples 

Example  1.  Figure  1  shows  an  epicyclic  hypoid  gear  train 
consisting  of  a  pair  of  meshing  hypoid  gears  (fixed  sun  gears 
1,  and  planet  gear  2)  connected  by  a  twisted  rigid  member 
(carrier  3).  Determine  the  instantaneous  invariants  which  char¬ 
acterize  the  geometric  properties  of  the  motion  of  the  planet 
gear  2  up  to  the  third  order. 

As  shown  in  Fig.  1 ,  we  use  a  moving  coordinate  system  { b  ] 
which  is  attached  to  the  planetary  gear  2  and  a  coordinate 
system  (f)  fixed  to  the  sun  gear  1.  The  dimensions  of  the 
hypoid  gear  train  is  given  by  the  dual  angle  5  =  6  +  e  6 


Rg.  1  Epicyclic  hypoid  gear  train 

between  the  two  gear  axes  U  b  3.  The  line  of  contact  of 
the  two  hypoid  gears  forms  the  dual  angle  f  —  f  +  e  f  with 
f3  and  dual  angle  ^  -i-  e  ^  with  63.  6  and  ^  are  the  constant 

angular  velocities  of  the  planet  gear  and  the  carrier,  respec¬ 
tively. 

If  the  gear  ratio  =  2,  S  =  60  deg.  +  e  3,  the  zero 
Sini? 

position  ^0  =  00  =  0,  and  the  angular  velocities  0i  =  |r“  = 

oino 

0.37796  and  =  2  ((>[  (see  Kirson,  1975,  and  Hsia,  1979). 
The  dual  matrix  A  defining  (b)  relative  to  {f )  is 


C4>  -s<t>  0  10  0  Cd  -se  0 

k=  s<t>  c<t>  0  0  C5  -sS  se  ce  o 

0  0  1  0  s8  cS  0  0  1 


C(t>ce-s<i>secl  -C4>se-s<t>cecB  S(t>^ 

s<i>ce+Cit>secS  -s<i>se+C4>C9Cl  -c<t>sl  (23) 

cesS  cS 


where  Ce  =  Cosfl,  S<i>  =__Sin<#,,  etc. 

From  the  dual  matrix  A  in  Eq.  (23)  and  the  given  data,  we 
can  calculate  the  R„  (n  =  0,  1,2,  3)  from  Eqs.  (9  and  10). 
Substituting-the  values  of  R„(n  =  0,  1,  2,  3)  into  Eqs.  (14- 
19),  we  obtain  the  instantaneous  invariants  of  the  planet  gear 
as  follows: 

a2i  =  l-f  0.74231  ^32  =  0.24744  +  6  0.42861 

422=0  213  =  0.31813  +  6  0.23622 

223= -1.06122  +  6  1.96931  433  =  0 


The  above  values  are  in  agreement  with  those  given  in  Kirson 
(1975)  and  Hsia  (1979). 
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Y  Pig-  3  Arbitrary  fixed  and  moving  coordinates  and  SE 

Fig.  2  RCCC  mechanism 


Example  2.  The  RCCC  mechanism  (one  revolute  pair  and 
three  cylindrical  pairs)  is  shown  in  Fig.  2.  The  dimensions  of 
the  Imks  are  specified  by  four  constant  dual  angles  a,  7, 
and  6.  The  relative  positions  ^of  the  links  are  defined  by  four 
time  dependent  dual  angles:  part  is  constant  since  the 

input  member  is  a  revolute  pair),  §,  and  The  angle  <i>  is 
the  input  and  ^  the  output  of  the  mechanism.  Determine  the 
third  order  instantaneous  invariants  of  the  coupler  link  cor¬ 
responding  to  a  given  value  of  input  angle  <f>o. 

We  choose  the  body  system  { b }  attached  to  the  coupler  link 
and  the  fixed  system  {f }  attached  to  the  frame  as  shown  in 
Fig.  2.  The  displacement,  velocity,  and  acceleration  of  the 
coupler  link  as  functions  of  input  angle  <j>  are  derived  in  Hsia 
(1979).  The  numerical  zero-position  values  of  the  dual  param¬ 
eters  are  given  as  follows: 

6  =  30" +€2, 

§0-86.6" -€2.9591  00  =  0. 

§i=- 1.9917-6  2.3066  0i=  1.8155. 

§2  =  0.05274  -Kell  .8924  02  =  0.2443 , 

§3= -7.51-6  13.77  03  =  7.99. 


The  above  results  are  verified  by  using  different  coordinate 
systems  in  describing  the  dual  matrix  A. 


Instantaneous  Invariants  for  Planar  Motion 

^  For  the  case  of  planar  motions,  the  components  of  R(J?i, 
will  become  pure  dual  numbers.  The  determination  of  the 
explicit  expressions  for  instantaneous  invariants  needs  some 
special  treatment.  It  means  that  we  have  to  deal  with  the  real 
part  and  the  dual  part  of  the  ctirvature  and  the  torsion  sep¬ 
arately.  Furthermore,  the  value  of  the  det  (R,,  R2,  R3,  Ro)  will 
vanish  due  to  the  existence  of  ^  term.  The  torsion  of  the  image 
curve  is  therefore  obtained  by  a  modified  method.  We  describe 
the  derivation  of  instantaneous  invariants  for  planar  motion 
as  follows: 

Consider  a  plane  M  in  continuous  motion  relative  to  a  fixed 
plane  F.  Let  us  choose,  arbitrarily,  two  coordinate  systems; 
9fll  (attached  to.M)  and  3"  (fixed  in  ¥),  as  shown  in  Fig.  3.  The 
transformation  matrix  of  SE  relative  to  is  given  by  the  matrix: 

—  Sv?  0  0  0  v  Op  —  Sv?  0 

A=  Sv?  '  0  -1-6  0  0  -u  Q?  0 

0  0  1  -V  u  0  0  0  1 


The  dual  matrix  A  defining  (b)  relative  to  {f}  is 


C0  -S0 
A=  S0  C0 
0  0 


-S6  0 
S§  C§  0 
0  0  1 


022= -€  5.21866  213  =  3-22836  +  6  15.25479 

023  =  0.3954-6  10,0621  133  =  0.37465-6  14.20382 


Op  — 

S(p  Op 

€(uS^  -  yC<p)  €(nC<p + vS^) 


-€U  (25) 

1 


C0C§-S0S§C6  -C0S§-S0C§C6  S0SS 
=  S0C§  +  C0S§C6  -S0S§  +  C0C§C5  -C0S6  (24) 

S§S5  G§SS  CS 


From  the  dual  matrix  A  in  Eq.  (24)  and  the  given  data,  we 
can  calculate  the  R„  (n  =  0,  1,  2,  3)  from  Eqs.  (9,  10).  Sub¬ 
stituting  the  vdues  of  R„  (n  =  0,  1,  2,  3)  into  Eqs.  (14-19), 
we  obtain  the  dual  instantaneous  invariants  of  the  coupler  link, 
up  to  the  third  order,  as  follows: 

021  =  1  +  6  4,58358  ^32=  1.83126  +  6  9.%317 


where  d  =  [u,  v,  0]”  is  the  position  vector  of  the  origin  of  311 
with  respect  to  3  and  ^  is  the  rotation  angle  of  311  relative  to 

We  may  express  d  as  a  function  of  the  angle  <p.  Consider  a 
planar  motion  given  as  functions  of  u(^)  and  v(^)  in  an  ar¬ 
bitrary  chosen  coordinate  system  up  to  the  fourth  order. 

u(^)  =  Uo  +  UiV?  +  ^  U2^^  +  “  U3^p^  +  :—  U49?'‘  +  0(v3^)  . 

z  o  24 

.  V(v?)  =  Vo  +  \i(p  +  ^  V2^^  +  ~  V3V?^  +  “  V4^^  +  0(^^) 

Z  6  24 

,  ^  dSr 

where  ^  V 


From  Eqs.  (4)  and  (25),  we  have  the  representation  of  the 
image  curve  as 
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R= 


X2 

X2 

L^4j 


r('*Cosf+vSm|) 

|e(vCosf-uSin0 


Sin  ^ 


Cos^ 

2 


(26) 


^  Using  Eqs,  (9,  10,  and  26),  we  can  determine  the  values  of 
Rn  (n  =  0,  1,  2,  3,  4)  at  the  reference  position.  Let  t;  =  t;  + 
eS,  «  =  ic  +  €S  and  f  =  V  +  e  r,  the  instantaneous  invariant 
b2  can  be  obtained  from  the  dual  part  of  the  curvature  k. 

From  the  Eq.  (8),  it  foUows  that 

[(RiAR2ARo)*(RiAR2ARo)]*^=  +  ^4^)1 

Now  substituting  the  values  of  R^,  at  the  reference  position 
in  the  above  equation,  we  have  ko  =  0  and 

But  =  2  ^  ^ 

:  a=4[(i;,  +  «2)^+(a|-i^)Y 

If  we  substitutethe  instantaneous  invariants  df  the  canonical 
system  (ao=gbb= ai = bj  =  a2 = 0).into  this  last  ©quation,  wehave 
the  formula  for  the  instantanwus  invariants  b2.  : 


b2 = V(vi + U2)^ + (Ui  -  V2)^ 


(27) 


From  Eq.  (26),  we  observe  that  ^1  and  are  pure  dual 
numbers.  The  ev^uation  of  det  (Ri,  R2,  R3,  Ro)  involve  only 
^e  term.  If  we  remove  the  dual  of  Xx  and  Xz^  det  (fti,  R2, 
R3,  Ro)  will  be  a  red  number  only.  The  real  part  of  torsion 
can  be  obtaiiied  from  the  e  ?  term  of  the  Uo^fo,  which  is  no2oTo, 
and  we  have 


To=- 


I'OKO 


(28) 


b3=—  (viVj+UiUa+UsVi-UiVj  +  VjVj  +  ujUj)  (32) 
l>2 

In  a  similar  fashion,  we  determine  the  instantaneous  invar¬ 
iants  a4  and  b4  by  evaluating  the  change  of  torsion  and  change 
of  change  of  curvature,  respectively.  We  obtain  the  explicit 
formulas  for  34  and  b4  as  follows: 

^4 = r-  [V1V2  +  U1U2  +  2(u3Vi  -  ui V3 + V2V3  +  U2U3)  -  U1U4 

02 

-ViV4-|-U4V2-U2V4]-2b3  (33) 

b4 [ul + vf  +  V1V3 + U1U3 + U3V2  -  U2V3  +  U4V1  -  Ui V4 

l>2 

'-r 


+  U2U4  +  V2V4  +  u|  +  \ij- 


+  b2  +  a3  4* 


(34) 


From  the  values  of  R«  that  we  have  already  determined,  we 
have 

(R  R  -  \  1 

=  “Yg  [i^+«i+4M2i;i“4ttit^-2t;iz;3 

+  3i^  -  2ttiU3  4-  3u2’^2u^V2  -  2^21^]  (29) 


.  Similarly,  we  substitute  the  instantaneous  invariants  of  the 
canonical  system  (ao=bo=ai  =  bi=a2=0)  into  Eqs.  (28)  and 
(29)  to  obtain.: : 

to=-3-2^  or  a,=  -|Tob2-|b2  (30) 

From  Eqs.  (28,  29,  and  30),  we  have  the  formula  for  the 
instantaneous  invariant  as. 

as  =  [vf  4-  Ui  +  4U2Vi  -  4UiV2  -  2ViV3  4-  3v2  -  2u,U3  4-  3U2 

202 

+  2U3V2-2U2V3]~|b2  (31) 

Using  the  same  method,  we  can  determine  the  instantaneous 
invariant  bs  by.  evaluating  the  change  of  curvature  and  we 
obtain  ici  =  €4b3  and 


Numerical  Example 

In  the  following  example  we  use  the  notations  u(0)  =  uf, 
v(0)  =  vj,  ^(0)  ~  <p2  etc.  Exmple:  Design  a  four-bar  linkage 
so  that  iht  input  crank  rotates  at  a  constant  angular 
velocity  of  2  rad/sec  (Clockwise)  while  the  output  crank  OcC 
rotates  wth  an  angular  velocity  of  5  rad/sec  (Counterclock¬ 
wise)  and  angular  accderatibn  of  21  rad/s^,  the  angular  ac¬ 
celeration  decreases  at  a  constant  rate  of  15  rad/s^  (This  example 
is  taken  from  Roth  and  Yang,  1977  and  Gupta,  1978).  For 
convenience  we  choose  the  fixed  frame  5  wiA  origia  at  O^,, 
and  x-axis  along  O^^Oc  and  the  mo^^ng  frame  tJH  with  origin 
at  Oc  and  x-axis  along  OcC  (see  Fig.  6  of  Roth  and  -Yang, 
1977).  The  motion  6f  link  OcC  relative  to  the  input  link-Oi3 
may  be  expressed  as 

u(t)=Cos(2t)=  1-4 1+16  ^+o(t^ 

v(t)  =  Siii(2t) = 2t  -  8  -  +  o(t’) 

6 

^  t3 

,,(t)=7t  +  21--15-+o(t’) 

From  the  above  expressions  we  obtain  the  values  of  time  de¬ 
rivatives  at  t  =  0: 

^^1=7 
-15 

^«4  =  0 

After  changing  the  parameter  t  to  we  obtain 

r 


II 

o 

II 

1 

11 

.*5- 

1^. 

II 

o 

c 

wn 

II 

o 

|vS--8 

[^U4*  =  16 

Ui  =  0 

U2=  -6.0816 
U3= 0.104956 
U4=  -0.2325 


^\i=0.2i51 
V2= -0.12245 
V3 =0.1466 
V4= -0.3309 


Substituting  the  above  values  into  Eqs.  (27),  (31),  and  (32), 
we  have  the  instantaneous  invariants 

b2=0.2380  a3= -0.4247  bj^ -0.1324 

34  =  0.5504  b4= -0.2373 

The  above  results  are  identical  to  those  given  in  Roth  and 
Yang  (1977)  and  Gupta  (1978). 


Conclusion 

We  have  derived  explicit  expressions  for  the  instantaneous 
invariants  of  rigid  body  motion  in  terms  of  arbitrary  rather 
than  the  canonical  coordinate  systems.  Equations  (14-22)  give 
the  expressions  of  the  instantaneous  invariants  for  spatial  and 
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spherical  motions  up  to  the  third  order.  Equations  (27  and  31- 
34)  give  the  expressions  of  instantaneous  invariants  for  planar 
motions  up  to  the  fourth  order.  A  few  examples  are  used  to 
illustrate  the  results.  These  expressions  are  more  explicit  than 
any  existing  methods  in  determining  the.  instantaneous  invar¬ 
iants  and  they  are  useful  in  design  and  analysis  of  mechanisms 
as  well  as  in  comparing  rigid  body  motions.  It  is  also  hoped 
that  these  expressions  would  facilitate  the  use  of  the  instan¬ 
taneous  invariants  in  more  practical  mechanism  design  and 
analysis  problems. 
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Nomenclature 

fofsl  torque  at  joint  k  (motor  torque) 
t^nia  torque  at  joint  k  due  to  the  dynamics  of  the 
robot  linkage 

torque  at  joint  k  due  to  the  dynamics  of  the 
load 

x^ai  torque  at  joint  k  due  to  the  dynamics  of  the 
mass  of  actuator  i 

m  number  of  discretized  points 
n  number  of  degrees  of  freedom  of  the  robot 
manipulator 
k  joint  number 

j  index  on  joint  number 

j  index  on  the  discretized  point 

Wj  weighting  factor 

Trffled  rated  torque  of  the  actuator 

inated  D.C.  motor  rated  current 

K  constant 

V  volume  of  the  actuator 

T  ((mxn)  x  1 )  total  joint  torque  vector 

Ta  ((rnxnLx  1)  joint  torque  vector  due  to  the 
dynamics  of  mass  of  the  actuators 

Tnia  ((mxn)  x  1)  joint  torque  vector  due  to  the 
linkage  dynamics 

T|  ((mxn)  X 1)  joint  torque  vector  due  to  the  load 
dynatnics 

Te  extra  torque;  ((mxn)  x  1)  vector  of  deviation 

of  the  portion  of  joint  torque  allocated  to  load 
Xi,  from  the  load  carrying  capacity,  "ioadnoai". 
resolved  into  joint  space  ^ 

Abstract 

A  procedure  for  sizing  the  joint  actuators  of  a  robot 
manipulator  in  the  design  stage  is  developed.  The  problem 
is  formulated  as  an  optimization  problem  where  the  actuators 


are  selected  to  achieve  a  desired  dynamic  load  carrying 
capacity  of  the  manipulator  over  an  entire  discretized 
trajectory.  Appropriate  constraints  on  this  optimization 
prevent  the  actuators  from  being  overloaded  and  limits  their 
sizes.  Using  worst  case  trajectories  within  the  workspace,  the 
problem  is  generalized  to  actuator  selection  over  the  entire 
workspace.  The  procedure  proposed  is  illustrated  by  the. 
choice  of  actuators  for  the  Carnegie  Mellon  University  (CMU) 
direct  drive  robot  manipulator. 

1  Introduction 

An  important  step  in  the  mechanical  design  of  robot 
manipulators  is  sizing  the  joint  actuators.  This  is  usually 
done  after  the  design  of  the  kinematic  configuration  of  the 
manipulator  as  well  as  stmctural  considerations.  Workspace 
considerations  and  solvability  of  kinematic  equations  are 
usually  used  to  choose  an  appropriate  kinematic 
configuration  and  structural  characteristics  of  a  manipulator 
are  selected  using  strength  and  stiffness  requirements.  Once 
these  two  steps  in  the  mechanical  design  of  a  manipulator 
are  completed,  then  the  joint  actuators  are  selected. 

Traditionally,  robot  actuator  sizing  have  been  based 
largely  on  use  of  simple  design  methods,  e.g.  they  are  sized 
to  meet  worst-case  or  average-case  gravity  and  acceleration 
torques  or  loads.  Such  procedures  are  too  simplistic  and 
usually  result  in  actuators  which  are  far  from  optimal.  A 
formalized  procedure  for  optimal  selection  of  robot  actuators 
was  developed  by  Vukobratovic,  Potkonjak  and  Datic  1984. 
Their  approach,  however,  minimized  the  energy  which  is  a 
non-task  oriented  objective  function.  In  addition,  their 
procedure  was  directed  toward  selection  of  hydraulic 
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actuators.  In  applications  such  as  material  handling  robots 
are  used  to  carry  a  payload  along  a  trajectory.  In  such  a 
situation,  the  robot  actuators  should  be  selected  based  on  a 
task  oriented  specification  such  as  a  desired  maximum 
payload.  Thomas,  Yuan-Chou  and  Tesar  1985  have 
considered  such  a  design  selection  strategy.  Their  approach, 
however,  sizes  the  actuators  only  locally  in  the  neighborhood 
of  a  configuration,  ideally,  the  actuators  should  be  sized  such 
that  the  robot  will  be  able  to  carry  the  maximum  desired 
payload  throughout  its  workspace.  This  is  the  problem 
considered  in  this  technical  note.  First,  selection  of  robot 
actuators  for  a  given  trajectory  and  a  maximum  desired 
payload  is  considered.  Then,  using  worst  case  trajectories  in 
the  workspace,  the  method  is  generalized  to  actuator  sizing 
for  a  manipulator  based  on  a  desired  dynamic  payload  and 
on  a  global  basis  (i-^-  over  the  entire  workspace). 

The  method  presented  uses  the  Dynamic  Load  Carrying 
Capacity  (DLCC)  of  a  robot  (see  Wang  and  Ravani  1988a)  to 
formulate  an  optimization  problem  for  the  selection  of  the  joint 
actuators.  All  joint  actuators  are  sized  simultaneously  and 
worst  case  trajectories  introduced  in  Wang  and  Ravani  1988b 
are  used  to  achieve  actuator  designs. that  can  dynamically 
carry  the  desired  payload  over  the  entire  workspace  of  the 
robot. 

2  Dynamic  Load  Carrying  Capacity  for  a  Given 

Trajectory 

The  dynamic  load  carrying  capacity  of  a  robot  manipulator 
depends  on  the  end-effector  trajectory  (position,  velocity  and 
acceleration),  and  can  be  defined  as  the  maximum  load  that 
the  manipulator  can  carry  in  executing  the  trajectory,  without 
exceeding  the  torque  limits  of  the  joint  actuators  (Wang  and 
Ravani  1988a). 

If  the  continuous  trajectory  of  the  end-effector,  is 
discretized  into  m  points  along  the  trajectory,  then  the  total 
torque/force  for  each  joint  at  every  grid  point  will  be  obtained 
by  linear  superposition  of  the  joint  torques/forces  due  to  the 
dynamics  of  the  robot  linkages  (excluding  the  masses  of  the 
actuators),  of  the  load,  and  the  dynamics  of  the  masses  of  the 
joint  actuators,  i.e. 

n 

{xk}j={Tkrt3}j+{xk,jj+  (1) 

i=k+1 

j=1,2,...,m. 

The  minimum  value  of  over  all  joints  (k),  and  entire 
discretized  trajectory  (j)  is  the  dynamic  load  carrying  capacity 
of  the  robot  for  that  trajectory. 


The  concept  of  DLCC  defined  by  Equation  (1)  will  be  used 
formulating  optimal  actuator  sizing  problem  in  Section  3. 

3  Optimal  Actuator  Sizing  formulation 
Since  the  most  popular  choice  of  the  actuators  for  medium 
and  small  size  manipulators  is  d.c.  motors  (either  permanent 
magnet  or  shunt  type),  we  limit  our  study  to  this  type  of 
actuators.  The  approach,  however,  can  easily  be  extended  to 
other  kinds  of  drives. 

Equation  (1),  gives  the  total  joint  torques  which  should  be 
supplied  by  the  joint  drives.  These  torques  are  bounded  by 
the  rated  torques  of  the  motors.  The  torque  that  actuator  can 
supply  for  carrying  the  load  at  each  grid  point,using  equation 
(DJs 

n 

{x^j={t^j-{xWj-  X{T^}i-  (2) 

i=k+1 

To  evaluate  x^ai,  the  mass  of  the  actuators  are  needed. 
Archer  and  Blenkinsop  1986,  have  developed  the 
relationship  between  the  volume  and  rated  torques  of  various 
electric  and  hydraulic  actuators.  The  general  form  of  this 
relationship  is 

log  Trated  =  « I09  V  +  p, 

where  a  and  p  are  the  coefficients  given  in  Archer  and 
Blenkinsop  1986.  This  last  relationship  will  be  used  for 
calculation  of  x’^ai-  The  mass  is  directly  related  to  volume  with 
electric-motor  density  of  3  to  4  gr/cm^,  (and  hydraulic  motor 
density  of  slightly  higher). 

D.C.  motors  in  control  applications,  e.g.  robotics,  are 
mostly  current-driven  (see  Keren  1 986).  In  such  a  case,  the 
actuator  can  be  idealized  as  a  torque  source.  The  output 
torque  of  the  actuator,Toutpuf .  should  be  limited  as 

•Tiated  =  *K  Irated  ^  Toutput  ^  K  Irated  =  Trated  •  (3) 

When  voltage-driven  d.c.  motors  are  used  for  the  joint 
actuators,  the  electrical  dynamics  of  the  unit  should  also  be 
considered  in  the  constraint  formulation. 

Equations  (2)  and  (3)  can  be  written  in  vector  form  over  all 
the  joints, 

ti=T-Xnia-i;a  (4) 

■Trated  ^  T  5  Trated  • 

We  consider  the  selection  of  joint  actuators  for  a  desired  load 
or  goal  carrying  capacity  (“loadgoai").  In  order  to  formulate  the 
objective  function,  we  define  the  extra  torque  vector,  Te.. 
This  torque  vector  is  defined  as  the  deviation  of  the  portion  of 
joint  torque  vector,  which  is  allocated  to  load  (Xi),  from  the 
vector  of  load  carrying  capacity,  "loadgoai".  resolved  into  joint 
space,  (Xgoal).  i.e. 
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Xe  =t  I  -  T  goal  =  (X  -  X  nia  •  X  a)  -  T  goal 
where  x  goal  is  a  vector  of  (m  x  n  )  x  1 ,  resulting  from  resolving 
the  desired  load  to  be  carried  ,  ("loadgoai")  into  joint  space, 
over  all  the  discretized  points  along  the  trajectory. 

The  optimization  problem  is  then  defined  as  the 
minimization  of  the  extra  torque  vector,  Xe,  i.e. 


4  Numerical  Example 

An  example  of  optimal  selection  of  robot  actuators,  based  on 
maximizing  the  dynamic  load  carrying  capacity  for  worst-case 
trajectory,  is  presented  for  a  robot  manipulator  with  the 
structure  of  the  Carnegie  Mellon  University  (CMU)  direct  drive 
arm  Asada  and  Kanade  1983,  as  shown  in  Rg.  1. 


min  (Xe)  =  min  (x  i  -  x  goal)  =  min  {(x  -  X  nia  -  X  a)  -  X  goat  }(5) 
constrained  by 

■Trated  5  X  S  Trated  •  (6) 

The  constraint  set  (6)  indicates  that  the  absolute  value  of 
each  Joint  torque  should  be  smaller  than  or  equal  to  the  rated 
torque  of  the  actuator  at  that  joint.  The  design  parameters  are 
X  and  Tfated-. 

The  vector  objective  function  (5),  is  converted  to  a  scalar 
objective  function  as  shown  in  Equation  (7),  using  sum  of  L2 
norms  of  weighted  extra  torque  vector,  Xe,  over  the  entire 
discretized  trajectory,  namely 


f  !  (  Wj[Xe  (i)]2) 


n 

S  (  Wi  [(Xi  -  xiinia  -  X%)  -  XUgoai  ]2) 


1/2 


Now  in  order  to  solve  the  optimization  problem  defined  by  the 
objective  function  (7)  and  the  set  of  inequality  constraints  (6), 
we  use  exterior  inequality  quadratic  penalty  function.  A 
computer  program  based  on  Quasi-Newton  method  is 
developed  to  solve  the  resulting  unconstrained  problem. 

3.1  Trajectory  Selection 

The  optimal  actuator  sizing  methodology  developed  in  this 
work,  depends  on  the  end-effector  trajectory.  In  sizing  the 
actuators,  it  is  desired  to  select  smallest  actuators  that  would 
allow  the  robot  to  carry  maximum  payload  in  a  desired  cycle 
time  throughoi^  its  entire  workspace.  This  means  that  the 
actuators  can  be  sized  based  on  the  worst-case  trajectories 
within  the  workspace.  For  each  worst-case  trajectory,  a 
corresponding  set  of  actuators  will  be  sized.  Amongst  these 
sets  of  actuators,  the  one  which  provides  the  maximum 
dynamic  load  carrying  capacity!?],  will  be  selected  as  the 
optimal  set  of  joint  actuators.  The  worst-case  trajectories 
defined  by  Wang  and  Ravani  [7],  Is  used  in  this  work. 


Fig.  1 .  A  three  joint  direct  drive  robot  (Reprint  from  [7]). 

The  initial  and  final  joint  configurations  of  the  robot  for  this 
example  are: 

[0°,0®.0°]Tand[0°,90°.90°]T 

A  cubic  polynomial  position  trajectory  is  used  between  these 
two  configurations.  The  two  end-points  correspond  to 
minimum  vertical  and  maximum  horizontal  positions  of  the 
end-effector.  The  maximum  horizontal  position  also  happens 
to  be  the  terminal  point  (see  Wang  and  Ravani  1988b).  This 
trajectory  satisfies  the  requirements  stated  in  Wang  and 
Ravani  1988b  for  the  worst-case  trajectory.  The  total  desired 
cycle  time  for  this  motion  is  1 .5  second.  This  time  period  Is 
based  on  average  speed  of  50  in/sec,  recommended  for 
industrial  robots  Asfahl  1985.  The  payload  (’’loadgoaf)  is 
taken  to  be  14  kg. 

In  order  to  guarantee  the  existence  of  a  minima,  rated 
torque  of  the  actuator  of  the  last  joint  has  been  selected 
apriori  to  be  40  Nm. 
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The  trajectory  is  discretized  into  10  equally  spaced  points. 
Joint  torques  due  to  link  dynamics,  x  nia-  sficl  due  to  carrying 
the  desired  load,  Xgoai.  are  evaluated  and  shown  in  Fig.'s  2 
and  3. 


,  a  II.  Joint#1 
»  Joir<«2 
_  Jairt«3 

Fig.  2.  Joint  torques  due  to  the  robot's  linkage  dynamics. 


_■ _  Joint  «1 

— « Jotfit«2 

^  -  Joint«3 

Fig.  3.  Joint  torques  due  to  dynamics  of  carrying  the 
desired  load. 

The  rated  torque  of  the  actuators  of  joints  1  and  2, 
optimized  over  the  entire  trajectory,  is  found  to  be  140  and  80 
Nm,  Moreover  optimal  rated  torques  of  the  first  and  . the 
second  joint  actuators  at  each  discrete  point  along  the 
trajectory  are  calculated  and  shown  in  Fig.  4. 


— —  TraTedJ 
■■  — » —  Traied_2 

Figure  4.  Optimal  actuators'  rated  torques  at  discrete 
points  along  the  trajectory. 

Comparison  of  Fig.  4  with  the  result  of  optimization  over 
the  entire  trajectory  (actuator  rated  torques  of  140  and  80 
Nm),  indicates  that  this  actuators  would  be  overloaded  at 
some  points  along  the  trajectory.  This  is  acceptable, 
considering  the  fact  that  actuators,  espedally  d.c.  motors,  are 
made  to  take  transient  overloading. 

If  the  transient  overloading  of  the  actuators  is  not 
desirable,  one  could  select  the  actuators  with  maximum 
Trated.  using  Fig.  4.  This  would  result  in  actuator  rated  torques 
of  270  and  130  Nm  and  keeps  the  rated  torque  of  the 
selected  actuators  always  higher  than  the  maximum  value  of 
total  joint  torques. 

The  original  actuators  of  joints  1  and  2  of  CMU  arm  have 
the  rated  torques  of  204  and  136  Nm. 

5  Conclusions 

A  method  is  presented  for  sizing  of  robot  joint  actuators 
based  on  meeting  a  functional  requirement  of  a  desired 
dynarhic  payioad  over  the  entire  robot  workspace.  The 
probleim  is  formulated  as  an  optimization  problem  minimizing 
the  sum  of  La  norms  of  weighted  extra  torque  vector  over  the 
entire  discretized  worst  case  trajectories:  The  constraints  are 
based  on  prevention  of  joint  actuators  from  excessive 
overloading. 

Actuators  are  selected  for  worst  case  trajectories  within 
the  workspace  resulting  in  manipulator  designs  that  can  meet 
a  desired  specification  of  dynamic  payload.  The  algorithm 
presented  provides  the  basis  for  rational  selection  of  joint 
actuators  in  mechanical  design  of  robot  manipulators. 


196 


6  Acknowledgement 

The  financial  support  of  this  work  by  US  Army  Research 
Office  grant  number  DAAL03-90-G-0005  is  gratefully 
acknowledged. 


7  REFERENCES 

1 .  Archer,  J.  R.,  and  Blenkinsop,  P.  T.,  1986,  "Actuation  for 
industrial  Robots",  Proc.  Instn.  Mech.  Engrs.  vol.  200  No. 


2.  Asada,  H.,  Kanade,  T.,  1 983,  "Control  of  Direct-Drive 
Arm",  ASME  Transsction,  J.  Dyn.  Sys.  Mess.  Contr.  vol. 
105,  Sept,  p.  136-142. 

3.  ^fahl,  C.  Ray.  1985,  Bobots  and  Manufacturing 
Automation  p.  1 70  ,  Wiley. 

4.  Gill,  P.  E,  Murray,  W.  and  Wright,  M..  H..  1 981 ,  "Practical 
ODtimlzation".Ac3cte/Ti/c  Press. 

5.  Keren,  Y.,  1986,  "Control  of  machine  tools  and  robots" 
AppI  Mech  Rev  Vol.  39,  No.  9,  Sept. 

6.  Wang,  L  T.,  and  Ravani.  B.,  1 988a,"Dynamic  Load 
Carrying  Capacity  of  Mechanical  Manipulators;  Part  I: 
Problem  Formulation",  ASME  Transaction.  J.  Dyn.  Sys. 
Meas.  Contr.  Vol.  110,  pp.  46-52. 

7.  Wang,  L  T.,  and  Ravani,  B.  1 988b."Dynamic  Load 
Carrying  Capacity  of  Mechanical  Manipulators;  Part  II: 
Numerical  Solution",  ASME  Transaction,  J.  Dyn.  Sys. 
Meas.  Contr.  110,  pp.  53-61. 


197 


DE-Vol.  26 


CAMS,  GEARS, 
ROBOT  AND 
MECHANISM  DESIGN 


presented  at 

THE  1990  ASME  DESIGN  TECHNICAL  CONFERENCES  - 
21st  BIENNIAL  MECHANISMS  CONFERENCE 
CHICAGO,  ILLINOIS 
SEPTEMBER  16-19,  1990 


sponsored  by 

THE  MECHANISMS  COMMITTEE  OF  THE 
DESIGN  ENGINEERING  DIVISION,  ASME 


edited  by 
A.  PISANO 

UNIVERSITY  OF  CAUFORNIA-BERKELEY 

M.  McCarthy 

UNIVERSITY  OF  CAUFORNIA-IRVINE 
S.  DERBY 

RENSSELAER  POLYTECHNIC  INSTITUTE 


THE  AMERICAN  SOCIETY  OF  MECHANICAL  ENGINEERS 
345  East  47th  Street  o  United  Engineering  Center  o  New  York,  N.Y.  10017 


M.  C.  Lin 
B.  Ravani 


Kinematics  of  the  Ball  Screw 
Mechanism 


S.  A.  Velinsky 

Department  of  Mechanical  and 
Aeronautical  Engineering, 
University  of  California- Davis, 
Davis,  CA  95616 


This  paper  studies  the  kinematics  of  the  Ball  Screw  Mechanism  {BSM)  with  the 
aim  of  developing  a  foundation  for  understanding  the  motion  of  the  balls  and  their 
contact  patterns  with  the  contacting  elements.  It  is  shown  that  there  is  always  slip 
between  the  balls  and  the  nut  or  screw,  and  therefore,  the  no-slip  condition  assumed 
in  the  BSM  literature  is  not  attainable.  The  effect  of  contact  deformation  on  the 
motion  of  the  balls  is  also  studied  and  is  used  to  develop  the  pattern  of  the  constant 
sliding  lines  of  contact  between  the  ball  and  the  screw  or  the  nut.  The  results  have 
applications  in  efficiency  analysis,  design,  wear  evaluation  and  finite  element  mod¬ 
eling  of  the  BSM. 


1  Introduction 

The  reciprocating  ball  screw  mechanism  is  a  force  and  mo¬ 
tion  transfer  device  belonging  to  the  family  of  power  trans¬ 
mission  screws  (Fig.  1).  Two  of  the  most  important  features 
of  the  mechanism  are  its  positional  accuracy  and  load  carrying 
capacity  making  it  suitable  as  the  drive  mechanism  for  robot 
manipulators  or  the  feed-drive  mechanism  of  machine  tools. 
The  utilization  of  bearing  balls  in  the  mechanism  replaces  the 
sliding  friction  of  the  conventional  power  screw  with  the  rolling 
friction  of  the  balls.  This  results  in  minimal  friction  during 
force  and  motion  transmission  and  eliminates  slipstick  with 
minimal  wear. 

This  paper  provicies  a  theoretical  study  of  the  kinematics  of 
the  ball  screw  mechanism.  It  derives  relationships  describing 
the  motion  of  the  ball  and  shows  that  slipping  takes  place 
between  the  ball  and  the  nut  (or  the  screw)  at  all  times.  This 
means  that  the  no-slip  condition  assumed  in  the  literature  (Levit, 
1963;  Drozdov,  1984)  is  unattainable.  The  proper  slip  condi¬ 
tions  are  derived  and  the  ball  motion  is  studied. 

In  addition,  the  effect  of  elastic  deformation  at  contact  areas 
between  the  ball  and  the  nut  (or  the  screw)  on  the  kinematics 


Fig.  1  A  bail  screw  and  nut  mechanism 
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of  the  mechanism  is  analyzed.  The  analysis  is  used  to  determine 
the  pattern  of  constant  sliding  lines  of  the  ball  in  the  contact 
area.  The  contact  line  pattern  is  useful  in  wear  and  finite 
element  analyses  of  the  ball  screw  mechanism.  The  work  pre¬ 
sented  provides  a  theoretical  framework  for  understanding  the 
motion  of  the  ball  in  the  BSM  and  sets  the  basis  for  efficiency 
and  design  analysis  of  the  mechanism.  The  application  of  the 
results  of  this  paper  to  efficiency  and  friction  analysis  of  the 
BSM  is  given  in  a  companion  paper  (Lin,  Velinsky,  and  Ravani, 
1994). 

In  all  the  analyses  presented  right-hand  screw  threads  and 
a  single  nut  are  assumed. 

2  Motion  of  the  Ball 

In  this  section,  we  study  the  motion  of  the  ball  by  affixing 
a  Frenet  coordinate  system  to  the  path  of  the  center  of  the 
ball.  This  will  enable  us  to  study  the  kinematics  of  the  ball 
motion  and  derive  the  slip  conditions. 

In  a  BSM,  the  center  of  the  ball  moves  along  the  helical 
groove  of  the  screw.  We  introduce  three  sets  of  coordinate 
systems.  The  first  (world)  coordinate  system,  ox'  y'  z  \  fixed 
with  its  z'  axis  coincident  with  the  axis  of  the  screw.  The  second 
(rotating)  coordinate  system,  oxyz,  also  has  its  z  axis  coincident 
with  the  screw  axis  (Fig.  2)  but  it  rotates  with  the  screw.  The 
third  coordinate  system  is  the  Frenet  frame  moving  with  the 
center  of  the  ball  along  the  trajectory  of  the  ball  center.  This 
trajectory,  with  respect  to  the  frame  oxyz,  is  a  circular  helical 
line  along  a  circular  cylindrical  surface  with  a  mean  radius 
(Fig.  2).  The  coordinate  transformation  between  the  first  two 
coordinate  systems  can  be  written  as 


X'=T,X 

where  X'  =  {i'  j’  k'Y, 

II 

C'n  -  0 

T,^ 

^0  0 

_  0  0  1 
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Fig.  2  The  position  of  the  ball  center,  O',  In  Cartesian  coordinates  and 
Frenet  coordinates 


Ys[tnb]^  and  12^ 


CaCe 

Sc. 


S$  Sa^d 

0 


(8) 


and  Cq  =  cos(Q),  =  sin(l]),  and  Q  denotes  the  angular 
displacement  between  the  two  coordinate  systems. 

Assume  that  a  ball  has  moved  through  an  angle  0k  along 
the  helical  groove  of  a  screw  with  lead  L.  The  position  vector 
of  the  ball  center,  R(0),  can  be  expressed  as 

TR(e)  =  R^X  (2) 

where  R’’  =  [r„C«  r„dta],  Q  =  cos(6),  S*  =  sm(d),  and 
the  helix  angle,  a,  is  defined  as  ta  =  tan(a)  =  L/lirr^^  The 
superscripts  “r”  and  **  are  used  here  to  distinguish  a  vector 

with  respect  to  the  rotational  and  the  world  coordinate  systems, 
respectively. 

By  substituting  Eq.  (1)  into  Eq.  (2),  the  position  of  the  ball 
center  can  be  expressed  with  respect  to  the  world  coordinate 
system  as 

’^R  =  R^Tr^X'.  (3) 

By  definition  (Kreyszig,  1983),  the  triad  of  unit  vectors  de¬ 
scribing  the  Frenet  Coordinate  system  of  the  ball  center  with 
respect  to  the  rotating  Cartesian  system,  oxyz,  can  be  expressed 
as  follows  (Fig.  2): 

(fl)  Unit  tangent  vector 

t^l-'kdSe  kdCe  rdJX  (4) 

(b)  Unit  normal  vector 

n=[-Q  -Se  0]X  (5) 

(c)  Unit  binormal  vector 

h  =  lrdS0  -TdCe  kd\\  (6) 


where 


curvature 


The  terms  and  Ca  denote  sin(a)  and  cos(a),  respectively. 
Now,  Eq.  (2)  can  be  rewritten  in  terms  of  the  Frenet  frame  of 
the  ball  center  as 

=  R^TjY  =  r„[SjJ  - 1  (9) 

It  will  be  shown  in  the  next  section,  that  the  ball  can  only 
move  relative  to  the  screw  in  the  tangential  direction  of  the 
Frenet  frame  of  the  ball  center  trajectory.  This  is  because  the 
ball  is  confined  along  the  helical  groove  in  directions  parallel 
to  the  normal  plane  of  the  trajectory  of  the  ball  center.  Phys¬ 
ically,  this  means  that  the  contact  points  between  the  ball  and 
the  screw,  as  well  as  between  the  ball  and  the  nut,  must  be 
located  on  this  normal  plane. 

In  order  to  locate  these  contact  points,  the  contact  angle, 
j3,  is  defined  as  the  angle  between  the  unit  normal  vector  and 
the  contact  veaor.  The  contact  vector  is  oriented  from  the  ball 
center  toward  the  contact  point,  as  shown  in  Fig.  3.  Points  A 
and  B  represent  the  instantaneous  contact  points  between  the 
ball  and  the  nut  and  between  the  ball  and  the  screw,  respec¬ 
tively.  Note  that  (FiS-  3)  is  considered  positive  when  meas¬ 
ured  clockwise  from  the  negative  side  of  the  normal  axis; 
whereas  fis  (Fig-  3)  is  considered  positive  when  measured  clock¬ 
wise  from  the  positive  side  of  the  normal  axis.  The  angle 
and  are  always  positive  for  a  counterclockwise  rotation  of 
the  screw  (viewed  from  the  end)  and  negative  for  a  clockwise 
screw  rotation. 

We  now  introduce  a  pair  of  new  coordinate  systems  iX,  YiZi , 
with  /  =  A,  B,  between  the  ball  and  the  raceway  such  that  the 
Xi  Yi  plane  lies  on  the  plane  of  contact  and  the  Zr^s  lies 
along  the  common  normal  of  the  two  contacting  bodies  (Fig. 
3).  These  coordinate  systems  are  used  to  describe  the  position 
of  the  contact  point  ^tween  the  two  contacting  bodies.  We 
also  assume  point  contacts  along  a  diagonal  line  between  the 
ball  and  the  screw  and  nut.  The  coordinate  transformation 
between  the  Frenet  frame  of  the  ball  center  trajectory  and  the 
iXi  YiZi  coordinate  system  is 

X/^TjY  (10) 

where 


and  torsion 


li 

Co 

R 

(7) 

X,=  [i/ 

j,  k,]^  and  T3  = 

0 

1 

0 

Q,- 

0 

*m 

0 

Q/ 

Sei 

The  coordinate  transformation  between  the  Frenet  frame  of 

the  ball  center  and  the  rotational  Coordinate  system  oxyz  can  The  position  vector  of  the  contact  point  B  with  respect  to 
be  expressed  as  the  ball  center  can  thus  be  expressed  as 
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trajectory  ot 


where 


an  arc  on  the  base 
dfcJe  of  the  cyfinder 

Fig.  4  Phase  angle  between  two  consecutive  balls 


By  solving  the  above  equation  for  the  phase  angle,  0,  the 
maximum  number  of  balls  per  revolution,  N,  can  be  obtained: 


27r 
0  ’ 


(16) 

(17) 


Furthermore,  Eq.  (14)  can  be  expressed  as 
6R/  =  2ri,n/. 

Similarly, 

6Ry=2r^ny.  (18) 

In  these  equations,  n/  and  ny  are  unit  vectors  between  two 
consecutive  ball  centers.  Using  the  Frenet  frame,  the  relative 
motion  between  the  centers  of  the  balls  in  contact  with  each 
other  can  be  written  as: 

■  clc^.+si 


Y(.e +<!>)= 


CaS^  Ca.iStt(l  —  C^) 

ScS^  sic^+cl 


W)- 


(19) 

The  above  equation  is  valid  for  balls  which  are  both  ahead 
(positive  values)  and  behind  (negative  (t>  values). 


b 


Fig.  5  Slip  velocities  at  steady  state  with  no-slip  along  the  tangential 
direction,  for  the  conversion  of  rotary  into  linear  motion 


Rao'  =  [0  0  r,]Xs  (11) 

where  denote  the  ball  radius.  The  position  vectors  of  contact 
points  A  and  B  with  respect  to  the  rotational  coordinate  system 
can  be  expressed  as 

Tt^-Tt  +  R^o'  (12) 

and 

'Ra='R  +  Rao'-  (13) 

There  is  only  one  independent  variable,  6,  which  describes 
the  relative  position  of  the  center  of  the  ball  with  respect  to 
the  screw  in  the  equation  of  the  circular  helical  line  [Eq.  (2)]. 
The  relative  position  of  the  two  consecutive  ball  centers  can 
therefore  be  expressed  as  a  function  of  a  phase  angle,  0,  shown 
in  Fig.  4,  as 

6R;='R(^  +  0)~'R(0) 

=  1-Q  S«(0~5^)]Y.  (14) 

Furthermore,  the  minimal  central  distance  between  two  con¬ 
secutive  ball  centers  must  be  equal  to  the  diameter,  2rf„  of  the 
balls;  that  is 

\/8Rj^dRf  =  2ri, 
or 

(0U^  +  2(1-Q)=4c7^  (15) 


3  Slip  Analysis 

Determination  of  the  slip  conditions  between  the  recipro¬ 
cating  balls  and  the  nut  or  the  screw  is  important  in  under¬ 
standing  the  motion  of  the  ball  in  the  ball  screw  mechanism. 
These  conditions  are  also  necessary  for  efficiency  analysis  and 
the  design  considerations  (see  Lin,  Velinsky,  and  Ravani,  1994). 
A  complete  velocity  analysis  is  necessary  for  determining  slip 
directions  and  velocities.  This  section  provides  such  an  analysis 
and  applies  the  results  to  a  characterization  of  the  slip  con¬ 
ditions  for  the  BSM.  Previous  works  (see  Levit,  1963;  and 
Drozdov,  1984)  have  treated  the  kinematics  of  the  BSM  using 
the  previous  results  from  ball  bearings  (see,  e.g.,  Harris,  1971; 
and  Jones,  1959).  This  has  resulted  in  incorrect  results  for  the 
BSM  since  the  angular  velocities  of  different  elements,  namely 
the  ball,  the  screw,  and  the  nut  are  not  additive  as  used  in  ball 
bearing  analysis. 

The  velocity  of  the  ball  center  with  respect  to  the  rotational 
Cartesian  Coordinates,  oxyz,  can  be  obtained  by  differentiat¬ 
ing  Eq.  (9)  with  respect  to  time;  i.e., 

=  0  0]Y.  (20) 

Note  that  in  the  above  equation,  the  velocity  of  the  ball 
center  relative  to  the  rotating  coordinate  system  has  only  a 
tangential  component.  Physically,  the  ball  cannot  move  in  the 
normal  plane  since  contacting  surfaces  would  have  to  separate 
or  crush  together  for  motion  in  the  normal  plane  to  exist. 
Furthermore,  the  radius  of  motion  of  the  ball  center  is  d,  which 
includes  both  the  curvature  and  the  torsion  of  the  helix. 

The  velocity  of  the  ball  center  with  respect  to  the  world 
coordinate  system,  ox' y'  z',  can  be  obtained  by  differentiat¬ 
ing  Eq.  (3)  with  respect  to  time;  i.e., 

’^R=[rf0+r„C«Q  0  -r„S„Q]Y.  (21) 

The  above  equation  can  also  be  obtained  from  ^R  =  ''R  + 
Ox^R. 

If  we  let  oj  =  [co/  o)„  oj^]  Y  be  the  angular  velocity  of  the 
ball,  then  the  instantaneous  velocity  of  the  two  points  A  and 
B  (namely  Va^  and  F^^)  on  the  ball  (which  are  coincident  with 
the  two  contact  points  between  the  nut  and  the  screw,  respec¬ 
tively)  can  be  expressed  as 
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and 


dB  CffA  -  <^n  S^A ) 

'* bOii  S^A 

^m  *^of^  “  ^b 


(22) 


and 


Vsi,=  ^R  +  ajxR 


BO 


dB  'VrmCa^-^ri 

~’r^a),S/35 


-^T 


(23) 


We  will  now  determine  the  velocity  of  the  point  on  the  nut 
which  is  coincident  with  the  contact  point  for  the  case  when 
the  screw  is  driving.  If  the  nut  is  driving,  similar  results  can 
easily  be  obtained.  We  will  only  consider  the  conversion  of 
rotary  into  linear  motion.  The  cases  involving  the  conversion 
of  linear  into  rotary  motion  are  merely  the  kinematic  inversions 
of  the  cases  presented  here. 

As  in  the  conventional  power  screw  unit,  if  the  nut  is  re¬ 
strained  from  rotating,  it  will  move  axially  a  distance  QL/2x 
along  the  screw  for  a  screw  rotation  of  angle  Q.  Hence,  if  the 
angular  velocity  of  the  screw  is  flk,  then  the  velocity  of  any 
point  on  the  nut  with  respect  to  the  world  coordinate  system, 
ojc  'y  'z  '  can  be  represented  as 

0  1]Y. 

(24) 

The  velocity  of  the  contact  point  on  the  screw  can  similarly 
be  obtained  as  follows. 

If  the  screw  rotates  with  an  angular  velocity  Qk,  the  velocity 
of  point  B  on  the  screw  (namely  Vbs)  coincident  with  the 
contact  point  will  be 


dB  — “  0)^  S|Sb] 

“rf,(w/-QSa)5^B 
rb((jJt—^Sa)  C0B 

rb{o)i-6Sa) 

dB  -r^  [(oj* — flCa)  Cbb  ”  Sbb] 
0 


IT 


^VsBnsB  (27) 

where  nsA  and  n$B  denote  the  unit  vectors  in  the  direction  of 
the  slip  velocities  at  the  instantaneous  contact  points  A  and 
B,  respectively,  and  and  \sb  are  the  magnitudes  of  these 
velocities  (Fig.  5). 

Physically,  contacting  surfaces  should  have  common  nor¬ 
mals  at  their  instantaneous  points  of  contact.  This  means  that 
the  slip  velocities  should  be  perpendicular  to  the  common  nor¬ 
mal  to  the  two  surfaces,  namely  Rao*  =  0  and 
=  0.  These  conditions  on  slip  velocities  represent  the  physical 
contact  requirement  for  the  contacting  surfaces  not  to  separate 
or  crush  together.  We  should  note  that  the  unit  vectors  nsA 
and  nsB  are  opposite  in  direction  to  the  frictional  forces  on 
the  ball  at  the  contact  points. 

The  location  of  all  contact  points  can  be  determined  using 
these  physical  contact  conditions.  Suppose  that  the  contact 
points  are  not  located  in  the  central  normal  plane.  Since  the 
balls  are  spherical,  only  two  angles  are  necessary  to  define  the 
location  of  a  contact  point  on  a  ball.  Suppose  these  two  angles 
are  Pb  and  an  angle  ^  off  the  central  normal  plane.  Consider 
point  B  as  an  example  to  simplify  the  analysis.  The  position 
vector  Rfio'  can  be  determined  from  geometry  as 

Substituting  this  last  equation  into  Eqs.  (11),  (13),  (23),  and 
(25),  we  can  obtain  V5B,  which  when  substituted  into  the  con¬ 
tact  conditions  result  in: 


VBs=«kx"RB 


(^m 

~TbSBB^a 


T 


Y. 


(25) 


This  velocity  is  with  respect  to  the  world  coordinate  system, 
ox'y'z'. 

The  slip  velocities  at  contact  points  A  and  B  (  Vsa  and  Vsbj 
respectively)  can  now  be  determined  as  follows. 

From  Eqs.  (22),  (23),  (24),  and  (25),  the  slip  velocities  at 
points  A  and  B  are,  respectively: 


ysA^^Ab-yAn 


d{6  +fi)  -aj;,S/5^) 

-rffUfCBA 

d(B  ■¥Q)-\-rB{(j3bCBA-<^nSBA) 
0 


X, 


=  ^SA^SA 


(26) 


Rbo^  ^ysB^  '^T^ddS^  —  O. 

Since  rb  d  cannot  be  zero  and  B  is  not  a  geometric  parameter, 
the  offset  angle,  has  to  be  zero  to  satisfy  the  contact  con¬ 
dition.  In  other  words,  all  the  contact  points  must  lie  on  the 
normal  plane. 

Let  us  decompose  the  slip-velocity  into  two  parts:  (/)  on  the 
central  normal  plane,  and  (2)  along  the  tangential  direction  of 
the  Frenet  frame  of  the  ball  center  trajectory.  Then,  as  we  can 
see  from  the  Eqs.  (26)  and  (27),  the  magnitudes  of  the  resultant 
slip-velocities  on  the  centr^  normal  plane  at  the  ball/nut  and 
the  ball/screw  contact  points  are  r*(co,~SaO)  and  re¬ 
spectively.  Accordingly,  no  co,  value  exists  which  causes  the 
slip-velocities  on  the  central  normal  plane  at  both  contact  points 
to  vanish  simultaneously.  In  other  words,  friction  can  never 
vanish  at  both  contact  points  on  the  central  normal  plane  unless 
the  helix  angle,  a,  equals  zero.  A  method  for  calculating  the 
frictional  losses,  which  has  been  used  as  the  basis  for  many 
subsequent  studies,  was  given  by  Levit  (1963)  who  defined  the 
BSM  friction  angle  to  lie  along  the  tangential  direction  of  the 
Frenet  frame  of  the  ball  center  trajectory.  This  produces  an 
inaccuracy  due  to  the  fact  that  it  does  not  account  for  the 
effects  of  the  torsion  of  the  helix. 

4  Acceleration  Analysis 

The  acceleration  of  the  ball  center  with  respect  to  the  ro¬ 
tational  Cartesian  coordinate  system,  oxyzj  can  be  expressed 
as 
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'■R=[rfe  rj^  0]Y  (28) 

where  dd  and  r„6^  represent  components  of  tangential  and 
centripetal  acceleration  of  the  ball  center  with  respect  to  the 
rotational  Coordinate  System  oxyz,  respectively.  The  accel¬ 
eration  of  the  ball  center  in  the  world  coordinate  system  is 
expressed  as 

'*'R  =  '^R  +  Qx''R  +  Qxfix''R  +  2Qx'’R 


=  [cf(0  +  C^Q)  /-^Ce+Q)^  -r„S„Q]Y  (29) 

where  the  component  along  the  normal  direction,  r„{6  +  ilf 
includes  both  the  centripetal  and  Coriolis  effects.  The  angular 
acceleration  of  the  ball  is  given  by  the  symbol  a: 

17- 


a  = 


Ccc{0  +  Q)  -i-Scj(^4*Q) 
ci„+  (cu,Ctt~coi,Sa)(^  +0) 
0)^ + ojfl + ^  d ) + Ccc  (^ + d ) 


(30) 


mation  between  the  Frenet  frame  and  the  iX,  T/Zy  system  is 
Xy^TjY  (31) 


where  the  first  term  in  each  component  represents  the  change 
in  magnitude  and  the  second  term  represents  the  gyroscopic 
motion. 

5  The  Kinematics  of  the  BSM  with  Elastic  Deformation 

Loads  acting  between  the  balls  and  raceways  in  the  BSM 
develop  only  small  areas  of  contact  between  the  mating  mem¬ 
bers.  Consequently,  although  the  elemental  loading  may  only 
be  moderate,  stresses  induced  on  the  surfaces  of  the  balls  and 
raceways  are  usuaUy  large.  Contact  deformations  are  caused 
by  contact  stresses.  Because  of  the  rigid  nature  of  the  balls, 
these  deformations  are  generally  of  a  low  order  of  magnitude. 
The  classical  solution  for  the  local  stress  and  deformation  of 
two  elastic  bodies  contacting  at  a  single  point  is  that  of  Hertz 
(1881)  which  has  been  applied  to  bail-bearing  problems.  Levit 
(1963)  introduced  the  theory  to  the  BSM.  However,  the  kin¬ 
ematics  of  the  BSM  regarding  elastic  deformations  has  never 
been  solved  completely.  It  is  the  purpose  of  this  section  to 
develop  the  internal  motions  of  the  BSM,  the  relative  slip- 
velocity  between  the  ball  and  the  nut/screw  and  the  pattern 
of  sliding  lines  of  contact  and  thus  set  the  foundation  for 
further  investigations  on  friaion,  wear  and  finite  element  anal¬ 
yses. 


5.1  Position  of  the  Omtact  Point.  To  specify  the  position 
of  a  contact  point  between  two  deformed  bodies,  a  coordinate 
system,  /AT/  Y/Z/,i  =  A,  B,”  is  introduced  between  the  ball  and 
the  raceway  (at  each  contact  point  as  before).  In  this  coordinate 
system,  the  Xi  Yi  plane  lies  on  the  plane  of  contact  and  the  Zr 
axis  is  the  common  normal  of  the  two  contacting  bodies.  Figure 
6  shows  a  ball  contacting  the  screw  raceway  such  that  the 


where 


x,  =  [/,  Ji  kf  and 


“  Cfii 

0  0 

Cfii  Sffi 


The  position  vector  of  an  arbitrary  point  Q,  on  the  Xb  Yb 
plane  with  respect  to  the  origin  B,  can  be  expressed  as  Rqb  = 
[xb  Yb  0]  Xb  and  the  position  vector  of  point  B  with  respect 
to  the  ball  center  is  Rbo'  =  [0  0  r*]Xa.  The  radius  of 
curvature  of  the  screw  raceway  groove  is 


rs-yj  rl-a\ ,  (32) 


and  the  radius  of  curvature  of  the  deformed  surface  as  defined 
by  Hertz  is 


Ri^ 


Ml 


where 


(33) 


fi 


i=A,  B. 
fb 


Therefore,  the  position  vector  of  point  Q  with  respect  to  the 
world  coordinates  is 

Reo'=Rcfl+Rao'  +  "'R.  (34) 


5.2  Velocity  of  BSM  with  Deformation.  The  derivation 
of  the  velocity  of  the  BSM  with  deformation  is  similar  to  the 
procedure  in  Section  3,  from  which,  the  following  equations 
can  be  determined: 

5.2.  i  Velocity  of  Any  Contact  Point  P/Q  on  the  Ball 


Vp4  =  '^R  +  coxR«,' 


d{d  +CaQ)  +  (XaSba+I’A  CBA)<^b+  CBA~rASBA)<^n 

—  (Xa  — Sg^)«,+x<  CO* 

-  r„  S„Q  -  (Xa  SgA  +  fA  CfsA  )u,-yA  Oln 


T 

Y 


R-l-a>xR^^ 

did  +clfl)  {Xb  SffB  ~^B  +  (-^B  ^35) 

=  —  {XsCBB'^rBS^Bl^t'^yB^^b 

—  {Xb  Sbb  +  7-0  C35) COy  -I- yB  03„ 


T 

Y 


(35) 


(36) 


normal  force  between  the  ball  and  the  screw  is  distributed  over 
an  elliptical  surface  defined  by  the  projected  major  and  minor 
semi-axes,  Cb  and  respectively.  The  coordinate  transfor- 


5.2.2  Velocity  of  Any  Contact  Point  P  on  the  Nut 

0  1]Y  (37) 
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5.2,3  Velocity  of  Any  Contact  Point  Q  on  the  Screw 
Vg5=  OK  X  Rqo 


—  CffB  +  ^Bb)  C’aO 
yeCa^-  (rBSBB-^XBCBB)Sa^ 
^  ^^bCbB’^XbSbb) 


T 


Y 


(38) 


Fig.  7  Slagnation  line,  line  of  no-slip  along  the  common  normal  of  the 
contact  surfaces 


spinning  velocity  on  the  normal  plane  is  along  the  common 
normal  of  the  contact  surfaces.  In  this  case,  the  common  nor- 
5.3  Slip  Velocities  at  Contact  Points  P  and  Q  has  been  referred  to  as  the  “spinning  axis”  in  the  ball- 


Vs;>  =  Vp,^Vp, 

d{d  {x^  Sb^  +  C0yi)o)B  +  {^A  QS/4  ~^A  Sba)^/! 

■“  {Xa  Sba  +  Cba  )<^t‘^yA^n 


^yAi  ^BA  +  ^0A  c^rt)  “  Ol 

Xa^Sba  +^(^'  +  0)4-r^(Cj3^O)*-S/3>iOJ„) 
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Xa 


“ O),  +  7^  ( Qa  <^b “  ^0A  ^n) 
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Vso  =  Vq^-V^s 


do  +  (xbSbb- ^bCbb)  Ca^)  ■+*  + 

{Xb  Cbb^  ^B^Bb)  (^a^  “^/)  ^Ot^) 

(Xj  S55  —  C/ja)  ( S^d c*>/)  —  >^5  a)„ 

”rj5(Scrd  — OJ^) 

( <^b  “  Ccfi  )  +  ^BB  ^rt]  +ci6  —  rslCffB (^b~  ^ad  )  ”  S^b 

XB(5aCi“a)/)  +J'5[C/SB(a)i,-Cad) 


Xb 


.(40) 


Note  that  the  equations  derived  in  this  section  can  be  reduced 
to  the  corresponding  equations  in  Section  3,  without  consid¬ 
ering  elastic  deformations,  where  n  =  r*  and  X/  =  yi  =  0. 

5,4  Slip  Velocity  and  Pattern  of  Constant  Sliding 
Lines.  The  pattern  of  constant  sliding  lines  of  ball  bearings 
in  the  elliptical  contact  area  was  investigated  by  Lundberg 
(1954)  and  presented  by  Harris  (1984)  in  his  book.  Here  a 
pattern  of  sliding  lines  are  derived  for  the  BSM  directly  from 
the  equations  of  slip-velocity  obtained  from  the  previous  sec¬ 
tion.  Since  the  pattern  of  constant  sliding  lines  are  similar  for 
different  types  of  motions,  only  the  case  with  screw  driving 
and  conversion  of  rotary  into  linear  motion  is  shown  as  an 
example.  In  accordance  with  the  Hertzian  radius  of  contact  in 
the  direction  transverse  to  the  motion,  the  contact  surface  has 
a  harmonic  mean  profile  radius.  This  implies  that  the  contact 
surface  is  not  straight  but  generally  curved.  The  no-slip  con¬ 
dition  along  the  common  normal  of  the  contact  surfaces  be¬ 
tween  the  ball  and  the  nut  is  obtained  from  the  Z^-component 
of  Eq.  (39)  as  follows: 

—  Xa  Wr  +  (^*>^  Cba  ~~  ^BA  )  =  0-  (41) 

This  equation  represents  a  straight  line,  referred  to  as  the 
stagnation  line,  on  the  Xa  Ya  plane.  This  line  passes  through 
the  origin  A,  with  a  slip  of  ci),/(o)i>CB/i -03n5/5^)»  as  shown  in 
Fig.  7. 

We  first  assume  that  the  nut  is  fixed  in  space.  Thus,  it  tends 
to  “cut-in”  the  contact  surface  for  every  contact  point  on  the 
ball  lying  on  one  side  of  the  stagnation  line  in  the  contact 
ellipse  and  “leave-from”  the  contact  surface  for  every  point 
lying  on  the  opposite  side  of  the  line.  There  is  a  special  case, 
in  which  Cqa  S^a  =  0,  which  indicates  that  the  resultant 


bearing  literature.  Additionally,  the  “stagnation  line”  becomes 
the  Tyi-axis  (x^  =  0)  which  is  true  because  pure  spin  will  not 
change  the  depth  of  “cut-in”  of  a  certain  point.  Similarly,  if 
0),  =  0,  the  “stagnation  line”  becomes  the  A^-axis  (y^  =  0) 
because  the  rolling  motion  changes  the  depth  of  “cut-in.” 

The  equation  of  constant  sliding  lines  can  now  be  obtained 
from  the  Xa-  and  T^-components  of  Eq.  (39);  that  is 

(Xa -PAf+  (yA  -qA?  =  C  (42) 

where 

_  d{6  ■h-tl)’¥rA(<j3b  ^BA  "  <*>n  ^BA  ) 

<^b  ^BA  +  Cqa 

_ _ 

and 

^ _ constant 

(<^b  ^BA  +  Cba)^ 

Equation  (42)  represents  a  family  of  circles  on  the  Xa  Ya- 
plane  with  center  located  at  (p^,  Qa  ),  where  the  no-slip  con¬ 
dition  on  the  Xa  y^-plane  occurs,  as  shown  in  Fig.  8.  It  is 
interesting  that  the  point,  (pa^Qa)^  will  not  lie  on  the  stagnation 
line  [Eq.  (41)]  unless  the  term,  0  +  Cl,  vanishes  which  is 
impossible  except  for  the  static  situation.  In  other  words,  a 
point  with  no-slip  is  not  possible  within  the  contact  ellipse. 
Note  that  can  also  be  defined  as  a  variable  from  Hertzian 
contact  theory  and  Eq.  (42)  remains  valid.  In  such  a  case,  Eq. 
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Fig.  8  Pattern  of  constant  sliding  lines  at  constant  surfaces 


(42)  results  in  a  family  of  non-coplanar  circles  representative 
of  the  true  contact  surface  shape. 

As  one  can  see  from  Eq.  (42),  under  the  special  case  where 

+  <^n  CpA  =0,  (43 ) 

the  slip-velocity  on  the  Xa  J^-plane  is  an  invariant  which  is 
exactly  the  same  as  the  slip-velocity  at  point  A  without  elastic 
deformation  if  The  line  defined  by  Eq.  (43),  which  is 

a  line  on  the  normal  plane  perpendicular  to  the  common  normal 
of  the  contact  surfaces,  has  been  called  the  “rolling  axis”  in 
the  case  of  ball-bearings.  It  is  obvious  from  Eq.  (42)  that  the 
no-slip  condition  is  not  possible  even  when  spinning  is  absent. 

A  similar  analysis  can  be  applied  to  the  contact  area  between 
the  ball  and  the  screw,  and  the  corresponding  resultant  equa¬ 
tions  are: 

Equation  of  stagnation  line: 

XsiSa^  -  {oib  -  C«li )  C^b  ~  S^b]  =  0  (44) 

Equation  of  constant  sliding  lines: 

(XB-PBf+{yB-QBf  =  C  (45) 

where 

de-rB[(et,-Cjl)CBB-<^„SeB\ 

{q3b  ”  )  Sbb  +  CfiB 

r5(5^d-aj,) 

= - ^ 

(  OJ*  -  )  Sbb  +  <^n  Cbb 

and 

^ _ constant _ 

[(o>a — Cad )  Sbb + Wrt  Cb^ 

Equation  of  “pure  rolling”: 

“  C^d )  Sbb + Wrt  Cbb  =  0  (46) 


Equation  of  “pure  spinning”: 

( ^6  ”  )  Cbb  -  Sbb  =  0  (47) 

Note  that  the  corresponding  equations  at  the  ball-nui  contact 
area  are  independent  of  the  helix  angle  of  the  BSM,  whereas 
those  at  the  ball-screw  contact  area  are  dependent  on  the  helix 
angle. 

Conclusions 

In  this  paper,  we  have  studied  the  kinematics  of  the  ball 
screw  mechanism  with  the  aim  of  understanding  the  slip  con¬ 
ditions  and  pattern  of  contact  points  between  the  elements. 
We  have  derived  the  general  slip  conditions  and  have  shown 
that  the  condition  of  no-slip  in  the  central  normal  plane  as¬ 
sumed  in  the  previous  literature  is  theoretically  unattainable. 
The  effect  of  contact  deformation  on  the  motion  of  the  balls 
was  also  studied  and  used  to  determine  the  pattern  of  sliding 
lines  of  the  balls  in  contact  areas. 

The  results,  in  addition  to  their  theoretical  interest,  provide 
the  basis  for  efficiency  analysis,  design,  wear  analysis,  and 
finite  element  modeling  of  the  ball  screw  mechanism. 
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Design  of  the  Ball  Screw 
Mechanism  for  Optimal  Efficiency 

This  paper  develops  theories  for  evaluating  the  efficiency  of  the  ball  screw  mechanism 
and  additionally,  for  designing  this  mechanism.  Initially,  a  quasi-static  analysis, 
which  is  similar  to  that  of  the  early  work  in  this  area,  is  employed  to  evaluate 
efficiency.  Dynamic  forces,  which  are  neglected  by  the  quasi-static  analysis,  will 
have  an  effect  on  efficiency.  Thus,  an  exact  theory  based  on  the  simultaneous  solution 
of  both  the  Newton-Euler  equations  of  motion  and  the  relevant  kinematic  equations 
is  employed  to  determine  mechanism  efficiency,  as  well  as  the  steady-state  motion 
of  all  components  within  theballscrew.  However,  the  development  of  design  methods 
based  on  this  exact  theory  is  difficult  due  to  the  extensive  computation  necessary 
and  thus,  an  approximate  closed-form  representation,  that  still  accounts  for  the  ball 
screw  dynamics,  is  derived.  The  validity  of  this  closed-form  solution  is  proven  and 
it  is  then  used  in  developing  an  optimum  design  methodology  for  the  ball  screw 
mechanism  based  on  efficiency.  Additionally,  the  self-braking  condition  is  examined, 
as  are  load  capacity  considerations. 


Introduction 

The  ball  screw  mechanism  (BSM)  has  been  used  for  many 
years  in  a  wide  variety  of  applications.  The  most  referred  to 
work  on  the  BSM  is  due  to  Levit  (1963a,  1963b).  In  his  work, 
Levit  has  both  reviewed  the  literature  prior  to  his,  as  well  as 
providing  a  series  of  calculations  for  designing  this  mechanism. 
Levit’s  work  has  provided  the  foundation  for  most  of  the 
subsequent  work  on  the  design  and  manufacture  of  this  mech¬ 
anism  including  that  of  Belyaev  and  associates  (1971,  1973, 
1974a,  1974b,  1981,  1983),  Drozdov  (1984)  and  Mukhortov 
(1982),  to  name  a  few.  Unfortunately,  Levit  includes  several 
improper  assumptions  causing  his  results,  as  well  as  those  based 
on  his  results,  to  be  questionable.  These  errors  are  noted  by 
Lin  et  al.  (1994),  who  have  taken  a  fundamental  approach  in 
examining  the  kinematics  of  the  BSM.  Basically,  Levit  does 
not  account  for  the  torsion  of  the  helical  path  of  the  individual 
ball  center. 

In  this  paper,  we  will  first  examine  the  efficiency  of  the  BSM 
using  a  quasi-static  approach  much  like  Levit’s,  but  with  the 
proper  kinematics.  Since  dynamic  forces  are  neglected  by  the 
quasi-static  analysis,  a  more  exact  approach  will  be  taken.  In 
this  approach,  we  will  consider  the  steady-state  motion  of  the 
ball  for  a  three-point-contact  profile  (Gothic  profile)  without 
deformation  by  numerically  solving  the  Newton-Euler  equa¬ 
tions  and  the  relevant  kinematic  equations  simultaneously. 
While  the  efficiency  of  the  BSM  actually  varies  as  a  function 
of  time,  the  steady-state  simplification  is  valid  for  many  ap¬ 
plications  and  additionally,  it  can  be  more  easily  applied  for 
developing  design  methods.  Closed-form  solutions  are  even 
more  easily  applied  to  the  development  of  design  methods  and 
such  solutions  are  derived  in  this  paper.  These  closed-form 
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solutions,  which  account  for  the  dynamic  forces,  are  shown 
to  be  valid  through  close  agreement  with  the  results  of  the 
steady-state  theory.  A  detailed  design  methodology  for  the 
BSM  based  on  the  closed-form  theory  is  then  presented. 

In  general,  the  analysis  of  the  ball  screw  motion  can  be 
divided  into  two  different  categories  according  to  the  driving 
component;  i.e.,  nut  driving  or  screw  driving.  Additionally, 
each  of  these  categories  can  be  further  divided  according  to 
the  type  of  input  motion;  i.e.,  conversion  of  rotary  into  linear 
motion  or  conversion  of  linear  into  rotary  motion.  Since  the 
procedures  for  analyzing  the  different  types  of  motion  are 
similar  and  all  types  of  motion  are  kinematic  inversions  of 
each  other,  in  this  paper,  we  will  only  examine  the  cases  with 
the  screw  as  the  driving  component. 

Quasi-Static  Efficiency  Analysis 

Levit,  in  his  classic  work  on  the  BSM,  assumes  that  there 
is  no-slip  between  the  balls  and  the  nut  and  screw.  Thus,  he 
derives  the  quasi-static  ball  screw  efficiency  by  assuming  that 
there  is  only  rolling  resistance  at  the  ball/screw  and  ball/nut 
contact  points.  Levit’s  results  imply  that  the  upper  bound  on 
ball  screw  efficiency  should  be  unity.  However,  recent  detailed 
examination  of  the  kinematics  of  the  BSM  proves  that  the  no¬ 
slip  condition  is  not  attainable  and  that  there  must  be  slip  on 
the  normal  plane  of  at  least  one  contact  point;  see  Lin  et  al. 
(1994).  Therefore,  in  this  work,  we  have  used  the  correct  kin¬ 
ematics  and  have  assumed  both  slip  on  the  normal  plane  and 
rolling  resistance  in  the  tangent  direction  at  the  contact  points 
in  order  to  determine  the  quasi-static  efficiency  of  the  BSM- 
Additionally,  we  assume  that  there  are  two  contact  points 
between  an  individual  ball  and  the  screw  and  nut. 

For  the  case  of  screw  driving,  suppose  that  a  moment,  Mk, 
is  acting  on  the  screw  in  order  to  overcome  an  axial  load 
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applied  on  the  nut.  By  considering  static  equilibrium  of  the 
screw,  the  ball  and  the  nut,  respectively,  we  arrive  at  the  fol¬ 
lowing  relations: 

+/(a  -  Q)  ]  +  ) , 


=  =  and 

(1) 

where 

M  =  the  torque  applied  on  the  screw  or  the  nut, 

Fg  =  the  axial  force  applied  on  the  screw  or  the  nut, 

Qi  =  the  normal  force  at  the  contact  points  between  the 
ball  and  nut  and  screw,  respectively, 
and  the  corresponding  subscripts: 

A  and  B  denote  the  points  between  the  ball  and  the 
nut  and  between  the  ball  and  the  screw,  respectively, 
/  =  the  Coulomb  coefficient  of  friction  at  the  contact 
points, 

rm  =  mean  radius  of  the  helical  path  of  the  ball  centerline, 
ri,  =  ball  radius, 
a  =  /-ft/rm, 

a  =  helix  angle  of  the  path  of  the  ball  centerline, 

Q»  5/  =  cosine  and  sine  functions,  respectively,  with  the  angle 
denoted  by  the  subscript,  and  we  will  later  use 
ti  =  tangent  function  with  the  angle  denoted  by  the  sub¬ 
script. 

The  friction  angle,  p,  is  used  to  represent  the  dissipated  energy 
due  to  rolling  friction  between  the  contact  surfaces  and  can 
be  represented  by  the  following  expression,  which  was  derived 
by  Levit: 

where/,  denotes  the  rolling  coefficient  of  friction.  We  addi¬ 
tionally  assume  that  the  contact  angles  between  the  ball  and 
the  screw  and  nut,  and  respectively,  are  equal;  i.e., 

-  &B  — 

The  efficiency  is  equal  to  the  ratio  of  the  work  done  by  the 
output  forces  to  the  work  done  by  the  input  and  can  be  rep¬ 
resented  as 


Sfi  +f{a—  )  -I-  tp/ ta 


(3) 


Using  a  similar  approach  for  the  case  of  nut  driving,  the  ef¬ 
ficiency  can  be  simply  expressed  as 


/(g+2Cg)-hV(Ce^^) 

Sfi  +/(g  +  Cff)  4“  tp/ ta 


Fig.  1  The  position  of  the  ball  center,  O in  Cartesian  coordinates  and 
Frenet-Serret  coordinates 


ciency,  in  certain  instances,  the  designer  may  elect  to  integrate 
the  general  equations  of  motion  in  order  to  monitor  the  time 
varying  efficiency. 

To  develop  the  equations  of  motion,  we  first  consider  the 
position  of  the  ball  center  and  we  employ  the  following  three 
coordinate  frames:  a  fixed  Cartesian  frame,  ox'y'z\  a  Carte¬ 
sian  frame  rotating  with  the  screw,  oxyz,  and  a  Frenet-Serret 
coordinate  frame  along  the  ball  trajectory,  o' nbt,  as  shown 
in  Fig.  1.  We  then  arrive  at  the  Newton-Euler  equations  which 
represent  the  ball  motion.  The  steady-state  form  of  these  equa¬ 
tions  of  motion,  for  the  case  of  converting  rotary  into  linear 
motion,  follow. 

+  045^4  +  04  ' )  =  0  {5a) 

Qa  ( Os  Qa  (  Oj 

+  Gx ' ( Q  +fS0C^A ')  =  mr„{e +  6)^  (Sb) 
Qa{Sb  +/CpC^^) ~ Qb{S0 -/CbC^b)  -Qa'(S0 -/CbC^a 0  =  0 

(5c) 


A  More  Exact  Efficiency  Analysis 

The  quasi-static  analysis  of  the  preceding  section  provides 
an  approximation  to  the  ball  screw’s  efficiency  since  dynamic 
forces,  which  will  have  an  effect,  have  been  neglected.  In 
actuality,  ball  screw  efficiency  will  vary  as  a  function  of  time 
due  to  the  system  dynamics  and  the  time-dependent  nature  of 
the  input  torque.  In  order  to  design  with  utmost  care,  one 
would  need  to  solve  the  general  equations  of  motion  in  order 
to  ^culate  the  time  varying  efficiency,  and  corresponding 
design  methods  could  be  developed  that  consider  either  a  max¬ 
imum  or  a  mean  value  of  efficiency.  However,  such  an  ap¬ 
proach  would  be  quite  tedious  and  results  may  be  difficult  to 
use  for  design  purposes.  By  considering  steady-state  ball  screw 
motion,  an  approximation  of  the  equations  of  motion  is  ob¬ 
tained  that  still  accounts  for  some  of  the  dynamic  forces.  This 
approximation  provides  a  basis  for  developing  a  simplified 
design  method  and  it  is  valid  for  many  ball  screw  applications. 
Since  the  steady-state  theory  results  in  an  approximate  effi¬ 


frt>(QBC^B-QACrt.A {5d) 
ff'bSffiQBSjf^B’^  QaS^a  +  Qa  ' )  =/(  ^  +  li)  ~ 

(5^) 

-frtCBiQBS^B-QAS^A  -  Qa 'S^A')  =  ri  B  +  Q)oA  (5/) 

where: 

=  the  angle  between  the  direction  of  the  friction 
force  and  the  normal  plane  at  the  contact  points, 
and  the  corresponding  subscripts: 

A  and  A '  denote  the  major  and  minor  contact 
point  between  the  ball  and  the  nut,  respectively, 
and  B  denotes  the  major  contact  point  between 
the  ball  and  the  screw, 
m  =  mass  of  an  individual  ball, 

I  ~  mass  moment  of  inertia  of  the  ball  relative  to 
its  mass  center, 

6  =  the  angular  velocity  of  the  ball  relative  to  the 
screw  along  the  helical  path. 
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Q  =  the  angular  velocity  of  the  screw,  and 

angular  velocity  of  the  ball  with  respect  to 
its  center  of  mass  in  the  normal,  tangential  and 
binormal  directions  of  the  helical  path. 

Note  that  these  equations  assume  a  three-point  contact  profile 
between  the  ball  and  the  nut  and  screw.  With  'the  three-point 
contact,  there  are  two  major  contact  points,  A  and  B,  through 
which  the  majority  of  the  load  is  transferred.  Additionally,  a 
minor  contact  point,  A ' ,  must  exist  in  order  to  equilibrate  the 
centripetal  force  and  adequately  constrain  the  ball.  We  ad¬ 
ditionally  assume  that  the  contact  angles  between  the  ball  and 
the  screw  and  nut,  and  respectively,  in  this  case, 

are  equal;  i.e.,/3^  ^  &  a'  =  0a  ==  j8.  From  a  design  perspective, 
equal  contact  angles  result  in  consistent  loading  over  all  of  the 
contact  points  helping  prevent  local  failure  and  premature 
fatigue.  Theoretically,  the  contact  angle  can  be  between  0  deg. 
and  90  deg.  In  actuality,  the  contact  angle  is  limited  by  ge¬ 
ometry  and  manufacturing  techniques  and  is  most  often  be¬ 
tween  45  deg.  and  60  deg.  The  interested  reader  is  again  referred 
to  Lin  et  al.  (1994)  for  a  more  detailed  derivation  of  these 
equations,  as  well  as  those  to  follow. 

The  slip  angles,  the  angles  between  the  friction  force  direc¬ 
tions  and  the  normal  plane  at  the  contact  points,  are  better 
defined  as 


where  and  Vyi  denote  the  magnitude  of  the  velocity  in  the 
normal  plane  and  the  tangential  direction,  respectively,  mth 
respect  to  the  local  coordinate  systems  at  the  contact  points; 
see  Fig.  2.  Through  a  study  of  the  kinematics  (see  Lin  et  al., 
1994),  the  following  slip  velocities  result: 

For  the  ball/nut  contact  point: 

VxA  =  **' ^yA  =  cl{6  Q)  +  r ^ (cj^C^ *“ conS/j)  (7) 

and  for  the  ball/screw  contact  point: 

Fxff=r*(ci),-dSa)  and 

VyB  =  de-rt,[{o)b-6c^)C^-o3„S^]  (8) 

where  d  =  r„/Ca- 

By  imposing  static  equilibrium  on  the  screw  and  nut,  we 
have  the  following  expressions  which  can  be  used  to  relate  Uie 
torque  applied  to  the  screw  (or  nut)  to  the  axial  load  acting 
on  the  nut  (or  screw): 

A/=  (TR/xFf-hlt/'XF/O^k 

=  rm{^t{Qi'  -  Qi)SotS0  +/[  -  Set  ( QfC^i  +  Qi '  Cyf,i'){wia  +  Q) 

+  C,(Q/S^/+  Qi'S^r){l  +  w/aQ)] )  and 

(F/+F/0*k 

=  W/(<2/-  Qi')CaSB  +yiCaC/5(2i^^/+  Qi'Q/') 

+  i=A,B.  (9) 

where 

'R/  =  the  position  vector  of  contact  point  /  with  respect  to 
the  rotational  Cartesian  coordinate  system  (see  Fig.  1), 
F/  =  the  force  vector  applied  on  the  ball  at  contact  point  /, 
and 

the  weighting  function,  w/,  allows  distinction  between  the  var¬ 
ious  contact  points  and  is  defined  as 

w/=  1  for  i  =  A  (A  denotes  the  ball/nut  contact) 

=  - 1  fox  i=B  (B  denotes  the  ball/screw  contact). 

Equation  (9)  provides  two  simultaneous  equations  for  either 
the  screw  driving  or  the  nut  driving  case.  When  a  torque  is 
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Fig.  2  Coordinate  systems  at  the  contact  points 


applied  to  the  nut,  the  subscript  /  =  A  will  apply  for  the 
moment  equation  and  the  subscript  /  =  B  vvill  apply  for  the 
force  equation  since  an  axial  force  will  be  acting  on  the  screw. 
When  the  torque  is  applied  to  the  screw,  /  =  B  applies  for  the 
moment  equation  and  /  =  A  applies  for  the  force  equation. 

Equations  (5)-(9)  have  been  solved  numerically  for  the  screw 
driving  case  with  the  following  ball  screw  parameter  values: 
rt  =  4.37  mm  (0.172  in.),  =  24.3  mm  (0.956  in.).  Q  = 

2000  rpm,  and  a  =  10  deg.  for  varying  contact  angles  or  ^ 

=  45  deg.  for  varying  helix  angles.  The  coefficient  of  friction, 
/,  is  assumed  equal  to  0.075,  a  value  representative  of  the 
contact  conditions  normally  found  in  the  BSM  and  this  value 
will  be  used  for  the  remainder  of  the  paper.  Additionally,  it 
is  assumed  that  an  axial  load,  F^,  equal  to  21 13  Nt  (475  lb)  is 
resisting  motion  of  the  nut.  Figures  3  and  4  depict  the  results. 
We  show  the  relationships  between  efficiency  and  contact  angle 
and  helix  angle  in  Fig.  3.  Figure  4  displays  the  normal  loads 
as  a  function  of  contact  angle.  In  this  figure,  the  curve  rep¬ 
resents  the  load  at  the  major  contact  points,  A  and  S,  which, 
for  all  practical  purposes,  are  equal,  and  the  load  at  the  minor 
contact  point,  A  \  which  is  orders  of  magnitude  smaller,  is 
not  distinguishable  from  the  abscissa. 

The  Approximate  Closed-Form  Solution 

The  theory  developed  above  involves  the  numerical  solution 
of  the  set  of  equations  representing  the  steady-state  motion  of 
the  ball  screw  and  designing  ball  screw  mechanisms  using  this 
theory  is  obviously  quite  cumbersome.  The  current  section 
derives  an  approximate  closed-form  solution  for  the  ball  sctcw 
motion  and  considerable  insight  is  gained  into  the  sensitive 
design  parameters.  This  solution  also  allows  us  to  investigate 
the  optimal  design  of  the  BSM. 

For  the  mechanism  operating  at  relatively  low  speed,  which 
is  true  for  most  BSM  currently  used  in  industry,  it  is  reasonable 
to  assume  that  the  centripetal  force  is  very  small  compared 
with  the  normal  loads.  Additionally,  from  the  numerical  results 
above,  we  recognize  that  the  normal  load  at  the  minor  contact 
point  is  very  small  compared  to  the  normal  loads  at  the  two 
major  contact  points.  The  third  contact  point  is  necessary  to 
provide  a  stable  system,  but  has  only  little  effect  on  the  re¬ 
sultant  motion  at  these  relatively  low  speeds.  Thus,  Qa  ~  Qb* 
indicating  that  a  two-point-contact  model  is  a  reasonable  ap¬ 
proximation.  For  the  particular  size  of  balls  used  in  this  model, 
we  find  that  this  approximation  is  valid  for  screw  speeds  as 
high  as  2, (XX)  rpm. 

With  the  assumption  that  Qa'  =0,  from  Eq.  (5t2),  we  obtain 
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Helix  Angle  or  Contact  Angle  (degrees) 

Fig.  3  Efficiency  as  a  function  of  helix  angle  or  contact  angler-con¬ 
version  of  rotary  into  linear  motion 


Slip  Angle.  Vg  (degrees) 

Fig.  5  Efficiency  as  a  function  of  the  slip  angle  between  the  ball  and 
the  screw 


Fig.  4 
rotary 


efficiency  point  of  view,  than  that  on  the  normal  plane.  Thus, 
the  upper  and  lower  limits  of  efficiency  are  at  =  0  deg. 
(normal  plane)  and  =  90  deg.  (tangential  direction),  re¬ 
spectively.  We  note  that  we  could  arrive  at  the  same  conclusions 
by  using  the  slip  angle  at  the  screw/ball  contact  point, 
since  ~  ^ 

Furthermore,  the  following  equation  is  obtained  by  dividing 
Eq.  (5/)  by  Eq.  (5t/): 

L 


C0t(^^)  +  C0t(^a)  =  ^ 


(12) 


By  combining  Eqs,  (10)  and  (12),  we  have 

^B  =  V'^~7r  =  tan' 


-fe)' 


Contact  Angle  (degrees) 

Normal  loads  as  a  function  of  contact  angle*-conversion  of 
into  linear  motion,  screw  driving 


Therefore,  Eq.  (11)  can  be  further  simplified  to 

/ 


(13) 


(14) 


Qb 


(10) 


which  implies  that  «  x  4-  ips-  Note  that  this  result  also 
agrees  well  with  the  numerical  results. 

We  note  that  the  nut  moves  a  distance  equal  to  -  fiL/2x  = 
-  Qr„ta  (where  L  denotes  the  lead  of  the  screw)  as  the  screw 
rotates  through  an  angle  Q.  Thus,  from  the  above  assumptions 
and  Eq.  (9),  the  efficiency  of  the  BSM,  tj,  can  now  be  written 
in  closed-form  as 


/(  4) _  Qjv 

5^+yiQa(ff“Q)+5^a(l-^Q)//«]‘  ^  ^ 

It  is  now  simple  to  observe  the  effects  of  the  various  parameters 
on  efficiency. 

Figure  5  shows  the  variation  in  efficiency  with  respect  to  the 
slip  angle  at  the  saew/ball  contact  point,  In  this  figure, 
the  following  parameter  values  are  used:  a  =  0.1  and  a  =  10 
deg.  The  curve  indicates  that  the  friction  along  the  tangential 
direction  dissipates  more  energy  than  that  on  the  normal  plane. 
In  other  words,  the  component  of  frictional  force  along  the 
tangential  direction  plays  a  more  important  role,  from  the 


and  we  note  that  the  only  parameters  affecting  efficiency  from 
this  equation  are  the  coefficient  of  friction,  the  helix  angle  and 
the  contact  angle. 

Using  Eq.  (14),  we  find  that  the  efficiencies  generated  are 
nearly  identical  to  the  numerical  results  obtained  from  the 
steady-state  solution  of  the  Newton-Euler  equations  shown  in 
Fig.  3  where  the  maximum  difference  between  the  theories  for 
the  efficiency  versus  helix  angle  relationship  is  0.33  percent 
and  between  the  theories  for  the  efficiency  versus  contact  angle 
relationship  is  0.86  percent.  Thus,  Eq.  (14)  is  a  valid  repre¬ 
sentation  of  the  efficiency  of  the  BSM  for  this  type  of  motion. 
Further  investigation  of  Fig.  3  reveals  that  an  optimum  contact 
angle  exists  for  peak  efficiency.  This  existence  of  an  optimum 
can  be  explained  as  follows.  If  one  considers  only  the  friction 
on  the  normal  plane,  the  efficiency  is  proportional  to  the  con¬ 
tact  angle,  which  can  be  observed  through  Eq.  (1 1).  However, 
from  Fig.  5,  the  component  of  frictional  force  along  the  tan¬ 
gential  direction,  which  dissipates  more  energy  than  that  on 
the  normal  plane,  becomes  more  important  at  higher  contact 
angles.  Hence,  a  peak  efficiency  exists  at  a  contact  angle  at 
which  the  two  frictional  components  reach  some  particular 
ratio.  To  obtain  the  optimum  contact  angle  mathematically, 
we  differentiate  the  denominator  of  Eq.  (14)  with  respect  to 
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Fig.  6  Relationship  between  contact  angle  and  helix  angle  to  achieve 
optimal  efficiency 


Helix  Angle,  a  (degrees) 

Fig.  7(a)  The  self-braking  design  space— conversion  of  rotary  into  lin¬ 
ear  motion 


the  contact  angle  and  set  the  derivative  equal  to  zero,  resulting 
in  the  following  expression. 

C|  -f  -f  2fSgi\]  —  Sff  (15) 

One  notes,  from  this  equation,  that  the  optimum  contact  angle 
can  be  expressed  as  a  function  of  helix  angle.  The  resultant 
relationship  is  depicted  in  Fig.  6.  This  curve  indicates  that  the 
peak  efficiency  exists  only  at  lower  helix  angles  and  higher 
contact  angles.  The  optimum  efficiency  is  arrived  at  by  merely 
solving  Eqs.  (14)  and  (15)  simultaneously.  The  resulting  op¬ 
timal  efficiency  value  is  86.1  percent  for  all  helix  angles  and 
the  corresponding  contact  angle  from  Eq.  (15),  and  the  coef¬ 
ficient  of  friction  equal  to  0.075. 

Self-braking  is  that  condition  in  which  it  is  kinematically 
impossible  for  the  mechanism  to  perform  useful  work  and  is, 
of  course,  highly  undesirable.  The  self-braking  condition  is 
determined  by  combining  Eqs.  (11)  and  (13)  and  results  when 
the  numerator  of  Eq.  (11)  becomes  negative;  i.e.. 


Sfi  ^f\l  -F  (16) 

The  shaded  region  below  the  curve  in  Fig.  7a  represents  the 
design  space  in  which  self-braking  occurs.  We  note  that  self¬ 
braking  takes  place  for  designs  with  low  contact  angles  for  the 
helix  angles  normally  used  and  additionally,  that  the  self-brak¬ 
ing  condition  always  takes  place  at  the  driven  side  of  the  mech¬ 
anism. 


Conversion  of  Linear  into  Rotary  Motion.  Employing  a 
similar  procedure  as  above,  we  obtain  the  efficiency  for  the 
case  of  converting  linear  into  rotary  motion  as 


7,=  !- 


/ 


+/( 


(17) 


It  is  again  easy  to  develop  relationships  between  efficiency  and 
contact  angle  and  helix  angle.  For  this  case,  the  trends  are  the 
same  as  the  case  of  converting  rotary  into  linear  motion,  but 
with  slightly  different  values.  Now,  the  peak  efficiency  occurs 
when  the  following  expression  holds: 

-F  -H  (18) 

and  the  resulting  relationship  between  contact  angle  and  helix 
angle  is  additionally  shown  in  Fig.  6.  This  curve  also  indicates 
that  the  peak  efficiency  exists  only  at  lower  helix  angles  and 


Helix  Angle,  a  (degrees) 

Fig.  7(6)  The  sell-braking  design  space— conversion  of  linear  into  ro¬ 
tary  motion 


higher  contact  angles.  The  optimum  efficiency  is  arrived  at  by 
merely  solving  Eqs.  (17)  and  (18)  simultaneously.  As  in  the 
case  of  conversion  of  rotary  to  linear  motion,  the  optimal 
efficiency  value  is  86.1  percent  for  all  helix  angles  and  the 
corresponding  contact  angle  from  Eq.  (18),  and  the  coefficient 
of  friction  equal  to  0.075.  In  this  case,  the  self-braking  con¬ 
dition  occurs  when 

/S^>\/c|+'d.  (19) 

The  shaded  region  above  the  curve  in  Fig.  1(b)  depicts  the 
design  space  for  self-braking  and  this  takes  place  only  for 
designs  with  low  helix  angles  and  high  contact  angles. 


Load  Capacity  Considerations 
The  normal  load  transmitted  through  the  contact  surfaces 
should  be  designed  to  be  as  low  as  possible  for  a  given  loading 
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condition  to  attain  the  highest  possible  load  capacity.  For  the 
case  of  conversion  of  rotary  into  linear  motion,  we  consider 
a  constant  load  Fg.  Then,  by  neglecting  the  friction  force  terms 
which  result  in  surface  tractions  as  opposed  to  normal  loads, 
from  Eq.  (9),  we  notice  that  causing  to  be  as  high  as 
possible  results  in  the  lowest  normal  loads.  This  implies  that 
designs  with  small  helix  angles  and  large  contact  angles  give 
the  BSM  the  maximum  load  capacity.  With  the  ball-screw 
designed  such  that  it  operates  at  its  maximum  efficiency,  we 
find  that  the  term  is  constant  for  all  helix  angles  and  the 
corresponding  contact  angle  from  Eq.  (15),  and  thus,  the  load 
carrying  capacity  cannot  be  improved  for  this  optimum  case. 

For  the  case  of  conversion  of  linear  into  rotary  motion, 
assuming  constant  load  Af,  Eq.  (9)  reveals  that  the  lowest 
normal  loads  occur  for  a  high  term.  This  indicates  that 

with  large  mean  radius,  helix  angle  and  contact  angle,  the  BSM 
has  the  highest  load  capacity.  Additionally,  we  note  that  the 
parameter  a  (ratio  of  ball  radius  to  mean  radius)  is  not  a  critical 
parameter  within  the  range  found  in  most  industry  applica¬ 
tions;  i.e.,  usually,  0.07  <  a  <  0.2.  Thus,  for  this  case,  one 
would  like  to  choose  the  highest  contact  angle  that  is  possible 
given  manufacturing  constraints  and  select  the  corresponding 
helix  angle  from  Eq.  (18)  for  optimum  efficiency  in  order  to 
design  the  most  efficient,  highest  load  carrying  ball  screw 
mechanism. 


Design  Procedure 


Based  on  the  results  arrived  at  in  the  preceding  analysis,  the 
following  methodology  is  recommended  for  the  design  of  the 
BSM: 


(/)  Select  a  preliminary  helix  angle  based  on  the  discussion 
related  to  load  capacity  and  the  corresponding  motion. 

(2)  Determine  the  contact  angle  for  optimal  efficiency  from 
either  Eq.  (15)  or  (18). 

(J)  Choose  the  screw  length  according  to  the  application 
requirements. 

{4)  Determine  the  mean  radius  such  that  >  screw  length/ 
60,  as  noted  by  Levit  (1963b). 

(5)  Select  the  ball  size  such  that  0.07  <  a  <  0.2. 

(6)  Determine  the  thread  profiles  such  that  0.95  <  r^/r^ 
<  0.97,  where  ri  =  the  radius  of  curvature  of  the  thread 
profile  for  the  nut  or  screw. 

(7)  Determine  the  approximate  contact  force  from  the 
specified  resistive  force  or  moment  and  the  equations  shown 
below,  which  were  derived  from  Eq.  (9)  with  friction  neglected. 

For  the  conversion  of  rotary  into  linear  motion: 


For  the  conversion  of  linear  into  rotary  motion: 


(20) 


2/  = 


M 


(21) 


(8)  Determine  the  relative  motion  between  the  screw  and 
the  nut  from  the  following  relationship: 


advance  distance  =  (r^/J  (angular  displacement). 

(P)  Calculate  the  maximum  contact  stress  using  the  ap¬ 
proximate  contact  load,  the  ball  radius,  the  thread  radius  of 
curvature  and  Hertz’  theory. 


(10)  Determine  the  efficiency  of  the  BSM  from  Eos 
or  (17).  ^ 

This  procedure  should  be  an  iterative  process  with  a  new  set 
of  ball  screw  dimensions  selected  if  any  of  the  criteria  are 
violated.  For  instance,  if  the  contact  stresses  determined  from 
the  Hertzian  theory  are  too  high,  most  likely  larger  dimensions 
are  necessary.  Similarly,  if  the  relative  motion  is  not  appro¬ 
priate  (size  constraints  are  violated),  a  new  helix  angle,  as  well 
as  other  dimensions,  may  be  necessary.  Additionally,  this  pro¬ 
cedure  has  not  discussed  the  speed  requirements  nor  the  avail¬ 
able  motor  power  of  the  application.  Obviously,  these  factors 
will  also  affect  the  final  design. 

Conclusions 

This  paper  provides  a  powerful  set  of  tools  for  analyzing 
and  designing  the  ball  screw  mechanism.  The  exact  steady- 
state  motion  within  the  ball  screw  mechanism  has  been  deter¬ 
mined  by  numerically  solving  the  Newton-Euler  equations  of 
motion  and  the  kinematic  equations  simultaneously.  This  pa¬ 
per  has  developed  a  simplified  closed-form  solution  for  ball 
screw  motion  and  has  employed  this  theory  in  considering  the 
optimum  design  of  this  mechanism.  In  terms  of  efficiency,  the 
driving  component  does  not  have  an  effect.  That  is,  only  the 
type  of  motion  conversion  is  important.  Thus,  the  necessary 
relationship  between  contact  angle  and  helix  angle  for  optimum 
efficiency  has  been  formulated  for  all  types  of  ball  screw  mo¬ 
tion.  Additionally,  the  conditions  for  self-braking  have  been 
discussed  as  well  as  the  load  capacity  of  the  BSM.  The  results 
of  the  closed-form  solution  have  been  presented  in  a  dimen¬ 
sionless  manner  allowing  this  work  to  be  applied  to  the  design 
of  any  ball  screw  mechanism.  Finally,  a  complete  design  pro¬ 
cedure  for  the  BSM  has  been  developed. 
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AbSSaa  In  this  paper,  a  ccHuplctc  solution  of  the  steady-state  Newton-Eulcr  equations  describing  the  motiOT 
of  the  balls  in  high  speed  thrust  ball  bearings  is  presented.  Kinematic  equations  are  derived  and  used  as 
anciUary  equations  to  obtain  the  numerical  solution  of  the  Ncwton-Euler  equations.  This  differs  from 
previously  reponed  work  where  extraneous  assumptions  (instead  of  kinematic  analysis)  are  used  to  allow 
^lutions  of  these  equaticHis.  It  is  shown  that  the  nosiip  condition  for  the  ball  motion  repo^  in  the  literature 
is  bas^  on  an  unnecessary  assumption.  The  correct  form  of  the  condition  is  dcvelopwi  and  used  in  the 
analysis  of  the  ball  motion.  The  analysis  shows  the  importance  of  contaa  angle  on  both  ball  motion  and 
contaa  force.  An  approximate  theory  is  then  developed  for  determining  the  optimum  contact  angle  that  would 
result  in  minimum  contact  force  in  high  speed  ball  bearings.  The  results  arc  conq)ared  with  experimental  data, 
and  theory  is  illustrated  by  an  example. 

Keywords.  Thrust  ball  bearings;  high  speed;  kinematics;  design. 


INTRODUCTION 

The  ball  motion  within  ball  bearings  was  first  investigated 
by  Jones  (1959)  where  he  derived  equations  governing  the 
steady-state  motion  of  the  ball.  Jones  did  not  solve  the 
equations  that  he  derived  and  only  used  them  to  investigate 
frictional  effects  in  ball  bearings.  Subsequently,  Harris  (1971) 
made  several  simplifying  assumptions  that  made  it  possible  to 
solve  Jones'  equations  obtaining  numerical  results  for  the 
motion  of  the  ball.  Harris'  technique,  also  presented  in  Harris 
(1984),  includes  an  assumption  that  there  is  a  sufficiently  large 
static  coefficient  of  friction  between  the  ball  and  the  raceway  to 
prevent  ball  rotation  due  to  the  gyroscopic  moment. 
Additionally,  he  assumed  that  there  is  no-slip  at  the  contact 
points  in  certain  directions.  Later,  Gupta  (1975,  1979) 
detomined  the  complete  solution  to  the  differentia  equations  of 
motion  of  the  ball  by  including  contact  deformations.  He  also 
considered  full  elastohydrodynamic  lubrication. 

In  tltis  paper,  we  show  that  neither  the  so  called  race  conm^ 
assumption  of  Jones  and  Harris  nor  the  inclusion  of  the  contact 
deformation  arc  necessary  for  obtaining  solutions  to  the 
equations  representing  ball  motion  in  hi^  speed  thrust  bail 
bearings.  For  such  devices,  one  can  assume  three  points  of 
contact  between  the  ball  and  the  raceways  and  solve  the 
equations  of  motion.  This  is  accomplished  through  the  use  of 
kineimtic  analysis  to  develop  ancill^  equations,  which  when 
combined  with  the  steady-state  equations,  allow  for  numerical 
solutions  describing  the  motion  of  the  ball.  In  performing  the 
kinematic  analysis,  we  develop  theoretically  correct  no-siip 
conditions  and  show  that  they  differ  from  tiiose  in  the  literature; 
see,  e.g.,  Palmgren  (1945),  AUen  (1964),  Houghton  (1976), 
Hamrock  and  Dowson  (1981),  and  Harris  (1984).  We  then 
apply  the  results  and  develop  an  approximate,  but  closed-form, 
theory  suitable  for  design  purposes.  This  approximate  theory  is 
used  to  develop  a  design  strategy  for  (^timum  contact  angle. 

KINEMATICS  OF  THRUST  BALL  BEARINGS 
Classical  ball  bearing  kinematics  assumes  tiiat  the  velocity  of  the 
ball  center  can  be  obtained  as  the  mean  of  the  inner  and  outer 
raceway  velocities,  and  additionally,  the  relative  angular  velocity 

between  the  inner  raceway  and  the  ball  is  defined  as  (cOj^-cd.). 
Accordingly,  previous  researchers  have  developed  the  following 
relationship  between  angular  velocities  for  the  no-slip  conditicxi: 

=  (1) 

where 


-  mean  radius  of  the  path  of  the  ball  center, 
r^  -  ball  radius,  and 

=  the  angular  velocity  of  the  ball  with  respect  to  its 
center. 

However,  the  arms  of  rotation  of  the  ball  and  the  raceways 
are  dififerent  such  that  the  angular  velocities  of  the  various 
elements  of  the  bearing  are  not  additive.  Hence,  the  correct 
equation  for  angular  velocities  with  the  no-slip  condition  is 
acuially 

rm“ni-'b®b  =  (vrb)o>i  (2) 

where,  here,  we  define 

gOjq  =  die  orbital  angular  velocity,  and 

0)^  =  the  angular  velocity  of  the  inner  raceway  with 

respect  to  the  ground. 

By  comparing  the  above  two  equations,  we  recognize  that  the 
arm  of  rotation  of  the  ball  center  from  Eqn.  (2)  is  equal  to  r^  as 
opposed  to  (rjj^-TfP  indicated  by  Eqn.  (1).  Depen^g  on  the 
relative  size  of  the  balls  and  the  raceway,  the  error  associated 
with  use  of  Eqn.  (1)  could  be  considerable. 
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Fig.  1.  The  coordinate  systems  employed  for 
the  kinematics  of  thrust  ball  bearings. 

We  now  study  the  kinematics  of  the  thrust  ball  bearing  for 
the  case  in  which  the  outer  raceway  is  fixed  and  the  inner 
raceway  is  routing.  The  case  with  the  inner  raceway  fixed  is 
merely  a  kinematic  inversion  of  this  case.  We  employ  the 
following  three  coordinate  frames:  a  fixed  Cartesian  f^e, 
oxjyjZj,  a  Canesian  frame  routing  with  the  inner  raceway. 


0x2X222*  and  a  ball  centered  coordinate  frame  moving  along  the 
ball  trajectory,  o’XjXjZj,  as  shown  in  Fig.  1.  Fig.  2  shows  the 
local  coordinate  system,  at  the  contact  point  between 

the  ball  and  the  inner  raceway  which  will  be  used  in  developing 
the  slip  velocities  at  this  point 


In  a  similar  manner,  the  slip  velocity  at  the  ball/outer  raceway 
contact  point  A,  can  be  represented  as 


With  the  assumption  of  rigid  body  motion,  we  arrive  at  the 
following  velocity  of  the  contact  point  on  the  ball: 


T 


X4  (3) 


CO  =  K  0^]  X3.  X3=  [i3  j3  k3]^.  X  =  [i^  j,  kj^. 

Ro»  =  the  absolute  velocity  of  the  origin  of  the  ball 

centered  coordinate  system, 

6  =  the  angular  velocity  of  the  ball  along  its  circular 

path  relative  to  the  inner-raceway, 

=  the  angular  velocity  the  inner-raceway, 

CDjj,  Q)y,  =  the  angular  velocity  of  the  bail  with  respect  to  its 
center  of  mass  in  the  X3,  y3,  Z3  directions, 
respectively,  of  the  ball  centered  coordinate 
system, 

Cj,  Sj  =  cosine  and  sine  functions,  respectively,  with  the 
angle  denoted  by  the  subscript,  and 

0  =  the  contact  angle  at  the  hall/inner  raceway  contact 

point 

Likewise,  the  velocity  of  the  contact  point  on  the  inner 

raceway  can  be  exi^essed  as 


r„(e  +  ii).r^[co,Sp+(a,Cp] 
0 


''bi- 

where 


0 

0 


(4) 


*2“  ['2  h  J  . 

and  Rg  denoces  the  positioa  of  the  contact  point  with  respect  to 
the  fixed  coordinate  syst^ 

Thus,  the  slip  velocity  at  contact  point  B  between  the  ball 
and  the  inner  raceway  can  be  expressed  as 


V  =V  -V  =fv  V 
SB  Bb  Bi  L’xB 


ya 


rm“y 


X,. 


(5) 


-rb“y 


rJ0  +  D)-  r^(o),Sp+<o,Cp) 


*4- 


0 


(6) 


No-slip  conditions  can  now  be  derived  from  these  last  two 
equations.  Ffcra  the  x-direction  slip  velocities: 

0>y  =  0  (7) 


and  from  the  y-diiection  slip  velocities: 

“*  =  o 


a 


l±aCp 

2 


(8) 

(9) 


n 


aCp±l 

2aC 

P 


(10) 


where  (0+Q)/ii  can  be  thought  of  as  the  orbital  velocity  of  the 

ball,  (iiJQ  as  the  ginning  velocity  of  the  ball,  a  =  r^/r^,  the  ratio 
of  the  ball  size  to  the  mean  radius  of  its  path,  and  the  sign 
applies  for  outer-raceway  driving  and  the  sign  applies  for 
inner-raceway  driving. 


DYNAMICS  OF  BALL  MOTION 


By  considering  equilibrium  of  the  individual  ball,  we  write  the 
steady-state  Newton-Euler  equations  governing  baU  motion  in 
thrust  ball  bearings  as  follows: 


-  0  (11a) 


=mr„(^Q)2 

(b) 

Qa(Sp + «tC,AKJB(Sp  -  - 

fCpC^A.)  =  0(c) 

frb(QBC,B-QAC,A-QA^A) = 

(d) 

fib®p(QBSyB'QASyA'*^’SvA‘)  * 

(e) 

-frb9p^QB^\(«‘QA^VA‘QA‘^VA’)  “  ® 

(f) 

where: 

Qi  =  the  normal  force  at  the  contact  points  between  the 

ball  and  outer-  and  inner-raceway,  respectively, 
Vi  =  the  angle  between  the  direction  of  the  friction 

force  and  the  normal  plane  at  die  contact  points, 
and  the  corresponding  subscripts: 

A  and  A'  denote  tiie  major  and  minor  contact 
point  between  the  ball  and  the  outer-raceway, 
respectively,  and  B  denotes  the  major  contact 
point  between  the  ball  and  the  inner-raceway, 
f  =  the  Coulomb  coefficient  of  friction  at  the  contact 

points, 

m  s  mass  of  an  individual  ball,  and 

I  =  mass  moment  of  inertia  of  the  ball  relative  to  its 

mass  center. 

Note  that  we  assume  that  there  are  three  points  of  contact 
between  the  ball  and  the  iimer  and  outer  raceways,  and  that  the 

contact  angles  arc  equal  at  these  contact  points;  i.e.,  = 

Pb  =  p.  Additionally,  we  note  that  these  governing  equations 
are  identical  to  those  of  Jones  (1959).  Also,  the  above  equations 

are  valid  for  the  case  of  the  inner  raceway  fixed  by  allowing  Q  to 
represent  the  outer  raceway  angular  velocity. 

The  slip  angles,  the  angles  between  the  friction  force 


0 


directions  and  the  normal  plane  at  the  contact  points,  are  better 
defined  as 


\|r.  =  7t  +  tan' 


yi 


V  . 
V 


(12) 


where  and  Vyj  denote  the  magnitude  of  the  slip  velocity  in 
the  normal  plane  and  the  tangential  direction,  respectively,  with 
respect  to  the  local  coordinates  at  the  contact  points;  see  Fig.  2. 
These  slip  velocities  have  been  derived  in  the  preceding  section 
and  appear  in  Eqns.  (5)  and  (6).  Equation  (12),  which  is  based 
on  the  kinematic  analysis,  dictates  the  directions  of  the  friction 
forces  at  the  contact  points.  These  equations  are  the  ancillary 
conditions  which  allow  the  solution  of  the  Newton-Euler 
equations  without  additonal  simplifying  assumptions.  We  note 
that  the  kinematics  employed  assumes  rigid  body  motion,  but 
that  it  would  be  reasonably  simple  to  extend  this  to  include 
contact  deformation. 

Now,  assume  that  the  thrust  load,  is  applied  by  the 
shaft  By  imposing  static  equilibrium  on  the  inner-raceway,  one 
obtains  the  following  equation: 


(13) 


Steady-state  ball  motion  is  then  determined  by  the  simultaneous 
solution  of  the  ten  nonlinear  algebraic  equations  (11-13)  which 

determines  the  ten  unknowns:  0,  cOy,  (B^,  Q^,  Q^^.,  Qg, 

and  Yg.  We  have  obtained  solutions  of  these  equations 
using  a  modified  Newton's  method;  see  Forsythe,  Malcolm  and 
Moler  (1977).  Results  of  the  numerical  solution  are  shown  in 
Fig.  3  in  which  we  compare  ball  velocities  to  the  experimental 
results  of  Poplawski  and  Mauriello  (1969).  We  employ  the 
following  parameter  values:  r^^  =  4.37  mm  (11/64  in.),  r^^  = 

24.27  mm  (0.9555  in.),  P  =  24.5®,  f  =  0.075,  and  n  =  35000 
rpm. 

Recognizing  that  the  contaa  angle  plays  a  critical  role  in  the 
resultant  characteristics  of  high-speed  thrust  ball-bearings,  we 
employ  the  above  equations  and  show  the  variation  in  ball 
motion  with  respect  to  contact  angle  in  Fig.  4.  We  have 
additionally  shown  the  effect  of  contaa  angle  on  the  noimal  load 
at  the  ball/outer-raceway  contact  point  in  Fig.  5.  In  addition  to 
the  noted  parameter  values.  Figs.  4  and  5  are  based  upon:  F^  = 

1780  N  (400  lb)  and  shaft  speed,  Q  =  27,500  rpm.  Of 
considerable  interest  in  Fig.  5  is  that  a  minimum  of  the  normal 
load  exists  at  some  contact  angle.  Should  the  thrust  bearing 
be  designed  at  such  a  contact  angle,  its  load  capacity  and  life 
could  be  maximized. 
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Fig.  3.  CompaiistHi  with  the  experimental  data  of 
Poplawski  and  Mauriello  (1969). 
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APPROXIMATE  THEORY  FOR  DESIGNING 
HIGH-SPEED  THRUST  BEARINGS 

The  theory  developed  above  involves  the  numerical  solution  of  a 
set  of  ^nations  and  the  design  of  ball  bearings  using  this  theory 
is  obviously  quite  cumbersome.  In  this  section,  we  provide  an 
approximation  to  the  theory  that  allows  a  closed-form  solution 
and  the  development  of  a  simple  methodology  for  determining 
the  optimum  contaa  angle. 

First,  consider  the  no-slip  conditions,  Eqns.  (7)-(l0), 
derived  earlier  in  this  paper.  These  conditions  provide  a  closcd- 


Contact  angle,  p  [degrees] 

Fig.  4.  Orbital  speed  vs  contact  angle. 

form  representation  of  the  bail  motion  and  results  are 
additionally  shown  in  Fig.  3.  Evidently,  with  the  low 
coefficient  of  friction  normally  found  in  ball  bearings,  the  no¬ 
slip  assumption  seems  to  provide  accurate  results  and  thus,  it 
can  be  quite  useful  for  developing  a  design  methodology. 

We  now  recognize  that  the  normal  load  consists  of  two 
major  components:  one  from  the  thrust  force  and  the  other  from 
tile  centripetal  force.  Therefore, 


(14) 


where  the  centripetal  force,  CF  =  mrjj^(6+Q)^.  The  first,  term  of 
the  above  equation  is  obtained  from  equation  (13)  by  neglecting 
the  friction  effect  (which  is  small)  and  the  second  term  is  a  result 
of  the  centripetal  force  which  is  shared  by  the  two  contact  points 
between  the  ball  and  the  outer-raceway  (see  Fig.  6).  As  we  can 
see  from  Eqn.  (14),  the  thrust  load  plays  a  greater  role  at  lower 
contaa  angles,  whereas,  the  centripetal  force  is  more  important 
at  higher  contact  angles.  Thus,  the  optimal  contact  angle 
depends  on  the  ratio  between  thrust  load  and  centripetal  force. 

Since,  the  no-slip  condition  along  the  tangential  direction  is 
a  reasonable  approximation  for  determining  the  motion  in  ball 
bearings,  we  substitute  Eqn.  (9)  into  (14)  and  set  the  first 
derivative  of  with  respect  to  the  contaa  angle  equal  to  zero. 
Thus,  we  arrive  at  this  condition  for  minimizing  the  normal  load: 


(15) 


where  t^  denotes  the  tangent  of  the  contact  angle,  p.  By 

recognizing  that  a^p  «  1,  we  arrive  at  the  following 
expression  for  the  optimum  contaa  angle: 


P  =  tan'^ 


(16) 


Fig.  7  shows  a  plot  of  the  optimum  contact  angle  based  on 
the  above  equation.  Note  that  Eqn.  (16)  is  valid  for  both  inner- 


Fig,  5.  Maximum  contaa  load  as  a  function  of  contact  angle. 
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Fig.  6.  Frcc-body  diagram  of  the  ball. 


and  outer-raceways  driving.  However,  according  to  the  no-slip 
conditions  derived  above,  the  cffbital  speed  is  about  14%  higher 
for  outer-raceway  driving  as  opposed  to  inner-raceway  driving. 
Thus,  the  ccntri^tal  force  is  about  29%  higher  for  the  outer- 
raceway  driving  case.  This  could  be  crucial  when  operating  at 
extremely  high  speeds.  Since  the  no- slip  assumption 
overestimates  the  centripetal  force  compared  to  the  exact 
solunon,  the  desired  contact  angle  should  be  slightly  larger  than 
that  obtained  from  Eqn.  (16). 

The  dimensionless  parameter,  is  representative 

of  the  ratio  between  the  thrust  load  and  the  centripeti  frace.  For 
extreme  cases,  as  the  parameter  approaches  zero,  the  optimal 
contact  angle  should  approach  zero.  As  the  parameter 
approaches  infinity,  the  optimal  contact  angle  approaches  90®. 
Finally,  when  this  parameter  equals  one,  the  optimal  contact 
angle  is  45®. 

EXAMPLE 

As  an  example,  we  use  the  bearing  dimensions  used  eaiiier.  The 
closed-form  solution,  Eqn.  (16),  arrives  at  an  optimum  contact 

angle  p  =  22.1®.  Using  the  exact  theory  represented  by  Eqns. 

(11-13)  and  much  computation,  the  optimum  contact  angle  p  = 
25®  as  shown  in  Fig.  5.  The  approximate  closed-form  solution 
has  underestimated  the  optimum  contact  angle  by  about  10%  for 
the  conditions  examined.  However,  the  difference  in  the 
resultant  contact  force  is  almost  insignificant  Additionally,  one 
notices  from  Fig.  5  that  the  selection  of  a  contact  angle  equal  to 
50®  would  result  in  a  maximum  load  approximately  22%  higher. 
For  an  actual  design,  we  recommend  the  use  of  the  closed-form 
solution  to  arrive  at  an  approximate  contact  angle  and  contact 
load.  If  the  result  were  physically  realizable,  the  designer  would 
then  employ  the  exact  the<^  to  evaluate  additional  details  of  the 
desi^.  As  a  matter  of  interest,  the  Poplawski  and  Mauriello 
bearing's  contact  angle  was  equal  to  24.5®  and  this  bearing 
operated  at  speeds  as  high  as  35,000  rpm. 


Fig.  7.  Optimum  contact  angle  as  a  function  of  dimensionless 
tluust  load/axial  force  ratio. 


CONCLUSIONS 

This  paper  has  provided  a  kinematic  analysis  of  bail  bearings 
and  has  applied  the  results  of  this  analysis  to  provide  the 
necessary  ancillary  equations  to  allow  the  solution  of  the  steady- 
state  Newton-Euler  ^nations  governing  ball  motion  in  thrust 
ball  bearings.  Previous  researchers  have  applied  numerous 
extraneous  assumptions  to  solve  these  equations  and  thus,  their 
results  are  not  accurate.  Additionally,  we  have  presented  the  no- 
siip  conditions  for  ball  bearings  bas^  on  the  kinematic  analysis 
correcting  those  that  have  appeared  in  the  literature  for  many 
years.  Interestingly  eno:  gh,  we  have  shown  that  the  no-slip 
assumption  leads  to  reasonable  results  for  the  low  coefficients  of 
friction  commonly  found  in  ball  bearings. 

Through  our  analysis,  we  have  shown  the  importance  of 
contact  an^e  on  both  ball  motion  and  contact  force.  We  have 
alro  developed  an  approximate  closed-form  solution  and  have 
u^zed  it  in  determining  the  optimal  contact  angle  to  be  used  in 
high-speed  ball  bearings  for  minimum  contact  force.  Finally, 
we  have  used  a  simple  example  to  illustrate  the  usefulness  of  the 
theory. 

This  work,  like  any  other,  has  sonw  shortcomings.  First, 
we  have  not  considered  fluid-film  lubrication  nor  the  effects  of 
frictional  heating  in  the  contact  locations.  Additionally,  we  have 
assumed  Coulomb  friction  whereas  in  reality,  friction  is  a 
complex  function  of  a  number  of  variables.  Finally,  we  have 
assumed  rigid  body  motion  and  have  not  considered  the  effects 
of  contact  deformation.  However,  we  believe  that  the  design 
method  developed  can  be  quite  useful  in  light  of  the  agreement 
between  the  developed  theory  and  previously  published 
experimental  data.  Furthermore,  we  would  hope  that  the  basic 
theory  would  be  extended  in  the  near  future  to  allow 
consideration  of  each  of  the  effects  mentioned. 
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Computer  Aided  Geometric  Design 
of  Motion  Interpoiants 

This  paper  studies  continuous  computational  geometry  of  motion  and  develops  a 
method  for  Computer  Aided  Geometric  Design  {CAGD)  of  motion  interpoiants. 
The  approach  uses  a  mapping  of  spatial  kinematics  to  convert  the  problem  of 
interpolating  displacements  to  that  of  interpolating  points  in  the  space  of  the  map¬ 
ping.  To  facilitate  the  point  interpolation,  the  previously  unorientable  mapping  space 
is  made  orientable.  Methods  are  then  developed  for  designing  spline  curves  in  the 
mapping  space  with  tangent,  curvature  and  torsion  continuities.  The  results  have 
application  in  computer  animation  of  three-dimensional  objects  used  in  computer 
graphics,  computer  vision  and  simulation  of  mechanical  systems. 


1  Introduction 

This  paper  deals  with  computational  geometry  of  motion. 
Kinematicians  and  machine  theorists  have,  for  many  years, 
studied  problems  in  kinematic  geometry  of  machines  and  mo¬ 
tions  (see,  for  example,  Hunt,  1978;  Bottema  and  Roth,  1979; 
and  McCarthy,  1990).  Computer  scientists  and  mathematicians 
have  developed  geometric  algorithms  and  methods  for  prob¬ 
lems  in  computer  graphics  and  computer  aided  design  (see,  for 
example,  Faux  and  Pratt,  1979;  Farin,  1993).  There  has  been, 
however,  no  merging  of  the  two  fields.  This  pa^r  combines 
and  modifies  concepts  from  kinematics  and  from  the  field  of 
Computer  Aided  Geometric  Design  (CAGD)  and_  provides  a 
solution  to  one  of  the  most  basic  problems  in  computational 
geometry  of  motion.  The  problem,  here  referred  to  as  motion 
interpolation,  is  that  of  finding  in-between  displacements  from 
a  set  of  given  displacements  to  provide  a  desired,  possibly, 
smooth,  animation  of  a  rigid  body  for  its  full  range  of  motion. 
In  computer  simulation  of  large  mechanical  systems  (see,  for 
example,  Tsai  and  Haug,  1991),' generation  of  tightly  spaced 
displacements  of  a  moving  body  along  its  trajectory  may  hot 
be  cost  effe^ve.  Instc^d,-a  series  of  displacements^ue  generated 
using  the  dynamic  equations  of  motion  and  then  the  in-between 
displacements  can  be  constmcted  using  appropriate  motion  in¬ 
terpoiants.  In  robotics,  generation  of  Cartesian  trajectories  re¬ 
quires  interpolation  of  specified  target'  configurations^  of  the 
end-effector.' in  computer  vision  and  computer  graphics  (see, 
for  example,  Turner  et  al.,  1991),  reconstructing  motion  of  a 
moving  object  or  a  camera  that  views  the  scene  also  requires 
interpolating  in-between  discrete  images  of  the  object  or  the 
scene. 

There  are  two  basic  issues  in  design  of  motion  interpoiants. 
The  first  one  is  very  basic  to  kinematics  and  concerns  repre¬ 
sentation  of  displacements.  The  second  basic  issue  is  com¬ 
putational  geometric  in  nature  and  is  related  to 
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parameterization  and  piecing  of  motion  interpoiants.  The  tra¬ 
ditional  approach  for  computer  animation  of  three-dimen¬ 
sional  objects  has  separated  the  interpolations  of  translations 
and  rotations  (Reeves,  1981).  Although  the  handling  of  trans¬ 
lation  is  straightforward,  rotation  interpolation  has  proved  to 
be  difficult.  If  orthogonal  rotation  matrices  are  used,  for  ex¬ 
ample,  their  interpolation  is  not  in  general  orthogonal  and 
therefore  does  not  represent  a  rotation.  Early  approaches  have 
involved  independent  interpolation  of  the  Euler  angles.  This 
is  cumbersome  resulting  in  speeding  up  or  slowing  down  the 
motion  and  depends  on  temporal  relationships.  More  recently, 
Shoemake  (1985),  Duff  (1986)  and  Pletinckx  (1989)  used  qua¬ 
ternions  for  aiiimating  rotations.  In’  this  paper  we  extend  these 
works  to  achieve  s&ohd  order  continuity  in  motion  interpo¬ 
lation.  We  also  use  a  kinematic  mapping  of  spatial  kinematics 
to  develop  a  general  framework  for  geometric  design  of  com¬ 
plete  motion  intc^lants  (including  both  translations  and  ro¬ 
tations)!  The  kineihatic  mapping  was  introduced  by  Ravani 
and  Roth  (1984)  for  solving. problems  in  spatial  kinematics 
and  mechanisms  and  is  usjed,  in  this  paper,  to  transform  the 
problem  of  designing  motion  interpoiants  into  that  of  inter¬ 
polating  points  in  a  special  projective  three-space  called  the 
image  space  of  the  mapping.  In  this  manner,  a  piecewise  par¬ 
ametric  motion  is  represented  by  curve  segments  in  the  image 
space.  Continuity  conditions  for  piecing  motion  segments  then 
correspond  to  geometric  continuity  conditions  for  the  corre¬ 
sponding  curve  segments  in  the  image  space.  The  image  space, 
however,  is  not  Euclidean  but  has  an  elliptic  dual  metric.  In 
addition,  it  is  not  orientable  and  as  a  result  point  interpolation 
in  this  space  cannot  be  defined  unambiguously.  We  modify 
the  image  space  so  that  it  is  orientable  and  then  develop  meth¬ 
ods  for  interpolation  and  approximation  in  this  space  with  not 
only  tangent  (first  order)  continuity  but  also  higher  order  con¬ 
tinuities  of  curvature  and  torsion. 

The  outline  of  the  paper  is  as  follows.  We  first  provide  a 
brief  description  for  the  image  space  of  kinematic  mapping. 
We  then  introduce  an  orientable  version  of  the  image  space 
by  replacing  a  point  with  two  coincident  but  oppositely  oriented 
copies.  Section  3  presents  a  linear  interpolation  in  the  image 
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space,  which  corresponds  to  a  screw  motiop  of  the  animated 
body.  Section  4  describes  the  generalization  of  these  linear 
results  to  cubic  interpolation  of  displacements.  Section  5  de¬ 
velops  the  differential  geometry  of  the  image  curves  in  so  far 
as  it  is  needed  for  developing  continuity  conditions  required 
in  design  of  piecewise  parametric  motions.  The  results  of  this 
section  are  then  used  in  Section  6  to  design  cubic  splines  in 
the  image  space  with  tangent  or  first  order  continuity  and  with 
higher  order  continuities  of  curvature  and  torsion. 


2  Displacements  as  Points  in  the  Image  Space 

This  section  first  reviews  the  image  space  of  kinematic  map¬ 
ping  ^avani  and  Roth,  1984)  as  a  geometric  representation 
of  a  spatial  displacement  and  then  extends  the  results  to  the 
representation  of  oriented  displacements. 

2.1  The  Classical  Image  Space.  The  general  displacement 
of  a  rigid  body  in  physical  space  (£^)  is  commonly  represented 
by  a  configuration  Af  =  ( [R] ,  d)  of  a  moving  frame  with  respect 
to  a  fixed  frame,  where  [R]  is  a  3  x  3  rotation  matrix  and 
d=  (dx,  dy,  dz)  is  a  translation  vector.  The  matrix  [R],  when 
written  in  terms  of  the  Euler  parameters  of  the  rotation, 
X=  (Xu  Xi,  X^y  X4)y  is  given  by  (Bottema  and  Roth,.  1979): 


Xl-Xl-'Xj^Xi 

2(XiX2‘^X^4) 

2(XiX2-XyX4) 

^x^+xl-xl^xl 

2(X2X2  +  XiX4) 


2(XiXy^X2X4)  ‘ 
2(X2X2-X,X4)  , 


(1) 


where  A^-Xi+Xi+X^+Xt  The  Euler  parameters  are  de¬ 
fined  by  the  rotation  angle  6  and  the  direction  vector  s=  (Sx, 
Sy,  Sz)  of  the  rotation  axis  as 


Xi=SxSin{e/2)y  A"2=5^sin(0/2), 
Ar3f5xSin(^/2),  Zi=cos(6/2).. 

Since  the  motion  of  a  rigid  body  in  has  six  degrees  of 
freedom,  the  geometry  for  which  a  ^splacement  as  an  element 
is  six-dimehsionid.'This  has  led  to  the  idea  of  using  a  set  of 
six  independent;  parameters  such  ^  (Xx/X^'y  X2/ X^y  X^/ A^, 
d^'  d^  dz)  to  repr^erif  a'  displacement'  as  :a  point  in  a  six¬ 
dimensional  space.  A  well-known  extension  of  this  idea  is  the 
notion  of  “configuration  space’’  in  the  contact  of  robot  motion 
planning  (Lozano-Perez,  1981;  and  Canny,  1986):  ^ 

"  '  More  elegant  representations  of  a  displacement  exik  where 
the  EMef  ^^atnet^rs  X^  arid  the  ti^slation 

y^br  as'definihg  the  direction 

and  location  of  a  line  in  a  space  of  four  dimensions,  respec¬ 
tively.  This  idea  waslfirst- introduced  by  Study  in  1891  who 
develop^  a  geometric  repr^Kentation  for  spa^^  displacements 
in  terms  of  what  is  ndw  referred  tb  as  Sft/jfy  vectdirs:  Study 
vectors  are  a  pair  of  vectoris  (X,'  -•  2X®)  that  generalize  to  four 
dimensions  Plucl^r  vectors  of  line  in  threcjctoensional  space, 
where  Vxf  denotes  ^the  vector  of  Euler  parameters  , and 
'X^,^  X]^y^y  Xty  X^)  ts  glveii  by 


1 

0 

i  - 

'Xj' 

X^I 

0 

1 

X2 

X^ 

~2 
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The  four-dimensional  space  of  lines  defined  by  Study  vectors 
is  called  the  Soma  spacCy  see  Bottema  and  Roth  (1979). 

Ravani  and  Roth  (1984)  introduced  a  more  compact  rep¬ 
resentation  of  the  Soma  space  by  rewriting  the  Study  vectors 
as  a  four-dimensional  vector  of  dual  numbers,  X=  (X\y  Xi, 


Xu  X4)y  where  Xi=Xi’^eXf  (/=  1,  2,  3,  4).  These  dual  num¬ 
bers  were  interpreted  as  a  set  of  homogeneous  coordinates  that 
defines  a  geometric  mapping  of  displacements  into  points  of 
a  projective  space  with  three  dual  dimensions.  This  mathe¬ 
matical  space  is  referred  to  as  the  image  space  of  spatial  kin¬ 
ematics  and  each  point  in  the  space,  called  the  image  pointy 
represents  a  spatial  displacement.  The  image  space  is  an  uno- 
rientable  projective  space,  for  the  points  X  and  -  X  are  con¬ 
sidered  to  be  identical. 

The  parameters  Xiy  Xf  (/=  1,  2,  3,  4)  are  not  independent 
but  must  satisfy  the  fundamental  relation: 

XiX°i+X2X^+XiXt+XX=^0.  (4) 

An  image  point  with  coordinates  satisfying  (4)  is  said  to  have 
normalized  coordinates.  A  general  set  of  four  dual  numbers 
to  be  used  as  homogeneous  coordinates  of  an  image  point  may 
not  satisfy  (4)  and  therefore  must  be  normalized’  A  simple 
normalizing  procedure  is  provided  in  Appendix  A  such  that 
not  only  (4)  is  satisfied  but  the  real  part  has  unit  length  as 
well.  For  the  rest  of  the  paper,  it  is  assumed  that  the  coordinates 
of  an  image  point  are  normalized  and  have  unit  length  unless 
explicitly  stated  otherwise. 

The  image  space  of  Ravani  and  Roth  has  been  shown  to  be 
a  useful  tool  for  analysis,  synthesis,  and  classification  of  spatial 
motions  (Ravani  and  Roth,  1984;  Ge  and  McCarthy,  l^la). 
It  has  also  been  used  for  obtaining  an  explicit  representation 
of  joint-space  obstacles  for  robot  motibn  planning  (Ge  and 
McCarthy,  1990)  andTor  characterizing  functional  constraints 
in  a  manufactured  assembly  (Ge  and  McCarthy,  1991b).  The 
image  space  in.its  original  form,  however,  is  not  quite  suitable 
for  computer  aided  :gebmetric  design  of  motion  interpolants, 
for  it  is  not  orientable.  This  means  that  there  is  no  consistent 
way  of  defining  “left”  or  “right”  handedness  of  a  coordinate 
tetrahedron  for  the  space.  It  also  means  that  there  does  not 
exist  the  notion  of  “betweenness”  and  as  a  result  one  cannot 
define  a  line-segment  uniquely.  These  are  serious  problems  to 
proper  interpolation  of  a  point  set.  In  the  next  section,  we 
present  an  orientable  version  of  this  image  space  to  resolve 
these  problems.  ,  ' 

2.2  The  Orientable  Image  Space.  We  start  the  introduc¬ 
tion  of  the  orientable  image  space  by  revisiting  the  rotation 
matrix  [R].  Given  [R],  the  rotation  angle  6  and  rotation  axis 
s  may  be  extracted  from  its  elements,  see  Bottema^d  Roth 
(1979),  Raul  (1981)  or  Shoemake  (1985)i  Ihere^tis,*  however, 
not  unique.  For  a  given  [R],  there  corresponds- a  pair  of  op¬ 
positely  oriented  rotations,  one  is  about  the  axis  s  with  rotation 
angle  d  and  is  termed  a  “forward”  rotation;  the  other  is  about 
the  axis  -s  with  rotation  angle ;2xt4^  and  is  terined  a  ^‘back- 
wmd” ’rotation. 

In’  view  6f  (2),  we  conclude  that  the^two  bppd  oriented 
rotations  can  be  represented  :by  the  iignbi?  Eider  parameters 
X=  (A^,  Xu  Xiy  X4)  snd 

respectively.  It  follows  that  a  general  configuration  M=  ([R], 
d)  may  be  obtained  by  either  of  two  oppositely  driehted  dis¬ 
placements:  one  is  a  “forward”  displacement  and  consists  of 
a  rotation  X  and  a  translation  d;  the  other  is:a  “backward” 
displacement  and  consists  of  a  rotation  -  X  and  a  translation 
d.  In  view  of  (3),  these  two  ^ppositely  oriented  displacements 
can  be  represented  by  two  sets  of  image  space  coordinates, 
X  and  “X,  respectively.  This  leads  naturally  to  the  notion  of 
signed  homogeneous  coordinates  for  the  image  space.  Instead 
of  identifying  X  and  -  X,  we  treat  them  as  defining  two  distinct 
points  which  occupy  the  same  position  in  the  image  space  but 
with  “opposite  orientations,”  namely  “forward”  and  “back¬ 
ward.”  To  borrow  a  term  from  spherical  geometry,  we  may 
call  these  two  points  antipodal  points.  Thus  a  general  dis¬ 
placement  corresponds  to  not  one  image  point  but  two  anti¬ 
podal  points.  The  image  space  consists  of  such  pairs  of 
coincident  but  oppositely  oriented  points  is  here  referred  to  as 
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the  orientable  image  space  and  is  denoted  jas  E.  In  general,  the 
signed  homogeneous  coordinates  X  and  wX,  where  w=w+ew^ 
is  a  nonpure  dual  number,  are  considered  to  defined  the  same 
point  if  w>0;  they  define  two  antipodal  points  if  w<0. 

Specialists  in  projective  geometry  will  notice  that  geometry 
of  the  orientable  image  space  E  is  equivalent  to  geometry  of 
a  unit  hyperspace  (denoted  by  in  a  space  of  four  dual 
dimensions,  with  oriented  points  and  oriented  lines  in  E  cotT 
responding  to  points  and  oriented  great  circles  on  respec¬ 
tively.  The  classical  image  space  is  topologically  different  from 
E.  Geometry  of  the  classical  image  space  is  identical  to  uno¬ 
riented  spherical  geometry  of  three  dual  dimensions,  for  one 
point  in  the  space  corresponds  to  .  two  diametrically  opposite 
points  on  an  unorientable  unit  dual  hyperspace.. 

The  distance  between  two  points  i  and  ^  in  E  is  in  general 
a  dual  number  and  is  defined  analogous  to  the  angular  distance 
in  elliptic  geometry  of  real  three-space.  The  dual-number  dis¬ 
tance  is  the  dual  angle  i-<t>  +  eh  obtained  from: 


cos^  = 


X*Y 


l/2» 


(5) 


where  X*^ = f i  +  ^2 ^2^+ ^3  ^3  +-^4^4  is  the  standard  scalar 
product.  The  dual  angle  $  is  uniquely  defined,  provided  that 
the  real  angle  0  is  restricted  to  the  range  [0,  x]  (see  the  Appendix 
A).  Kinematically,  the  dual  an^eapediies  tihe  magnitude  of 
a  relative  screw  displacement  between  the  displacements  rep¬ 
resented  by  X  and  Y:  2^  is  the  angle  ofxotationiabout  the 
screw  axis  of  the  relative  displacement  and  2h  is  the  distance 
of  translation  along  the  same'axis.  When  two  iihage  points 
satisfy  X*$  =  0,  the  distance  between  them  is  t/2  and  they  are 
said  to  be  a  pair  of  pb/or  points.  The  corresponding  displace¬ 
ments  are  related  by  a  h^f  turn  (a  jpure  rotation  with  angle 
t). 


2.3  Curves  and  Planes  in  the  Image  Space.  Smee  the  im¬ 
age  space  E  has  three  dual  dimensions,  a  general  curve  in  E  is 
a  one-dual-dimensional  curve  or  a  twofold  curve.,  A  general 
plane  in  E  has  two  dual  dimensions  and  is  here  referred  to  as 
a  twofold  plane.  A  twofold  euWe  is  the  mapping  of  a  twor 
degree-of-freedpm  motion.  A  one-degree-of-freedom  motion 
maps  into  a  special  curve  in  E  called  z  unifold  curve.  A  twofold 
plane  is  the  mapping  of  a  four-degree-oMreedoni  motion.: 

The  simplest  twofold  curve  in  E  is  an  oriented  twofold  straight 
line.  Given  two  driented  points  &  and  jr  the^twofoldiJine 
joining  from  X  to^  ^^  is 'unique;  and  can  be  represented  by  a 
directed  bivector  XA’t.  ^wher^.the  symbol  “A”  denotes  the 
vector  wedge  product  which  generalizes;  to  higher  dimensions 
the  vector  cross  product  of  three-dimensional  vectorialgebra, 
see  Flanders  (1963)  and  McCarthy  (1990).  The:six  components 
of  'k.Ai.  are  the  signed  Plucker  coordinates  of  the  Im^ 
line  joining  from to.Xris  giyen'by^  the  biyect  and.  has 

the  opposite  sense  direction  to;ihe:Iine^A$;  rkinematically,"  a 
twofold  line  Xa$  is  the  mapping  of  a  twOrdpgree-ofrfreedom 
screw  motion  that  contains -the  displacements  represented  by 
i  and:S’^The  screwjnption  consists..of  two.independent  simple 
motions  about  the  fixed  screw  axis;  a  rotation  and  a  translation. 
If  the  translation  is  made  dependent  on  the  rotation,  then  the 
resulting  motion  becomes  a  one-degree-ofrfreedom  screw  mo¬ 
tion  which  maps  into  a  unifold  line  in  E.  r 

A  general  twofold  plane  is  the  mapping  of  a  four-degree- 
of-freedom  line-symmetric  motion.  To  see  this,  we  consider  a 
plane  P  defined  by  three  oriented  points^]X,  If  and  Z.  It  can 
be  represented  by  the  directed  trivector  XaI'a^.  Since  the  E 
space  is  an  oriented  projective  dual  three-space,  a  formal  dual¬ 
ity  exists  between  oriented  points  and  oriented  twofold  plmes. 
It  follows  that  the  four  components,  F=  {Pu  ^3»  A)*  of 
the  trivector  XA'tAZ  define  the  pole  of  the  plane  P.  Any  point 
of  the  plane,  say  G,  satisfies  the  relation  F«G  =  0  and  thus  the 
two  displacements  represented  by  f  and^G  are  related  by  a 
half-turn.  Therefore  the  twofold  plane  XaYaZ  represents  a 


four-degree-of-freedom  line-symmetric  motion  that  contains 
the  displacements  X,  Y  and  Z.  A  planar  twofold  curve  is  the 
mapping  of  a  two-degree-of- freedom  line-symmetric  motion 
and  a  planar  unifold  curve  is  the  mapping  of  a  one-degree-of- 
freedom  line-symmetric  motion. 

The  point  coordinates  of  a  unifold  image  curve  depend  on 
a  real  parameter  A  They  are  denoted  by  W.(/)  =  w(/)X(f) 
where  W(/)  =  (  lKi(0,  lK2(f),^1^3(0,  1^4(0)  represents  the 
nonnormdized  coordinates,  X(0  = 

Jc^it))  represents  the  unit-normalized  coordinates,  and  >v(0 
denotes  the  normalizing  factor  (see  Appendix  B).  If  the  co¬ 
ordinates  ^i{t)  (/=1,  2,  3,  4)  are  specified  bfy  linear,  quad¬ 
ratic,  or  cubic  functions  of  t,  the  corresponding  unifold  image 
curve  is  a  straight  line,  a  conic,  or  a  cubic,  respectively.  The 
resulting  motion  is  termed  /wear,  conic,  of  cubic  motions, 
respectively.  Physically, a  linear  motion  is  a  screw  motion  and 
a  conic  motion  is  a  line-symmetric  motion. 

The  velocity  distribution  of  a  spatial,  motion  is  determined 
by  the  dual  velocity  V,  which  can  be  expressed  in  tenns  of  the 
point  coordinates  of  an  image  curve  X  ( t)  and  its  fost  derivative 
i(/)  using  quaternion  algebra.  Let.:(i,  j,  k,rP  denote  the 
quaternion  basis.  Then  the  dual  velocity  is  given  by  the  qua¬ 
ternion  product:  .-i  ...  ^ 

^  (6) 

where  ]S[=^ii+jf2j is  a  unit  quaternion  and 
-X,i-^2i"-f3k+^4  is  its  inverse. This Tesiilt  is  ob^ 
tained  from  dualization  of  a  sinul^  .fonnuI^Tpr  spherical 
kinematics  presented  in  Bottema  and  Roth  (1979). 


3  Linear  Interpolation  of  Two  Displacements 

Two  displacements  Dq  and  Di  of  a  rigid  body  correspond 
to  two  configurations  Mi  of  a  moving  frame  with 

respect  to  a  fixed  frame,  which  in^tura  correspond  to  two  pairs 
of  oppositely  oriented  points  ±Xo  and  respectively,  in 
the  orientable  image  space  E.  In  this  paper;  we  choose  orien¬ 
tations  (or  signs)  of  the  image  points  Xp  and  X|  such  that 
Xo*Xi  >  0.  GeOmetric^y,  this  means  that  the  angular  distance 
<#»  between  and  is  in  the  range  [0,  t/2].  Two  such  oriented 
points  are  referred  io.as  similarly  oriented poin^^J^i^  vf if ; 


Di  becomes  that  of  findi^  a^iitufold  line-se^ent  th^ 
two  simHarly  priented;^mtsi^:and  ix.:Let:lbe;  unifoid^^ 
segment  be  given  byi^nW^:  ;  ^  ■  ..  : 

'  ^  (7) 

where  %{t)rfX{tX]d^tptpl^e  -and  miit-norin 

.coordinate, rTttpiei^Yeljri^apd^  ^(i)  jdenpt^;^ 
factor.  We.  seek  to detern^Cj^^ 

;i),such  th^ 

The  iMuit  is'  jie  followi^^^  line^  mten>9lant(,‘  .I;’  ' 

where  #0= be  a^itrary  ^uail  hihnb'ers. 
In  practice  we  require  1^0  and  to  have  positive’re^  parts  so 
that  all  displacements  of  the  linear  motion  are  similarly  ori¬ 
ented.  The  normalizing  factor  i^(t)  is  obtained  from  (9)  as 

ivU)  =  [(1  +  (10) 


The  unifold  line-segment  (9)  is  the  image  curve  of  a  screw 
motion  that  interpolates  through  the  configurations^  Mo  and 

Ml. 

To  see  the  effect  of  the  dual-number  factors  Wo  and  h^i  on 


/For  the  sake  of  convenience,  we  sometimes  use  the  words  “displacements*’ 
and  “conngurations**  interchangeably  in  this  paper. 
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the  velocity  distribution  of  the  animated  body,  we  use  (6)  to 
obtain  the  dual  velocity: 


x,Xo-'-^r 


(1 -r)^(wo/w,)  +  +  2r(i  -  0  (Xo'Xi) 


(ID 


where  the  term  ‘)  is  a  vector  quaternion.  Equa- 

tion  (11)  shows  that  the  screw  axis  is  fixed  (as  anticipated  for 
a  screw  motion)  but  in  general  both  the  angular  velocity  and 
the  pitch  of  the  screw  motion  are  functions  of  the  parameter 
t  and  the  ratio  wo/h^i.  The  choice  of  influences  the 

angular  velocity  and  the  pitch  of  the  screw  motion  but  it  is 
impossible  to  choose  the  ratio  such  that  the  pitch  or  angular 
velocity  remains  constant  throughout  the  screw  motion. 


4  Cubic  Interpolation  of  Displacements 

In  the  preceding  section,  we  dealt  with  the  problem  of  two- 
configuration  interpolation  using  a  linear  motion  interpolant. 
In  this  section,  we  develop  cubic  parametric  motions  that  in¬ 
terpolate  through  not  only  two  given  configurations  but  two 
instantaneous  dual  velocities  as  well.  This  is  referred  as  as  first 
order  motion  interpolation. 

4.1'  First  Order  End  Conditions.  In  the  image  space 
the  two  configurations  Mq^  Mx  are  represented  by  two  similarly 
oriented  image  points  Xj ,  respectively.  Let  the  dual  velocity 

at  Afi  {/=0,  1)  be where  2fi/=2(Q;+€Q?)  (Q/^0,  Q, 
®  ^  0)  is  the  dual  speed  and  is  the  unit  dual  vector  representing 
the  instantaneous  screw  axis.  We  define  the  “tangent”'  of  the 
desired  interpolating  motion  at  A/|  '(/=0,  1)  in  terms  of  the 
derivative  of  the  image  curve  X(/)  of  the  motion.  In  view  of 
(6),  the  derivative  of  i(0  at  Mi  (/=0,  1)  is  given  by  the 
quaternion  product: 

where  It  is  not  difficult  to  show  that  t/  satisfies 

T|*t;=  1  and  T/*X/=  0.  Therefore  %  is  a  polar^point  of  X/  and 
can  be  interpreted  as  a  “tangent  vector”  at  X/  with  t/  being 
a  “unit  tangent  vector”  and  fi/  being  its  magnitude. 

The  problem  of  finding  a  motion  that  interpolates  through 
configurations  Mo,  Mi  as  well  as  the  dud  velocities  Vo,  Vi 
becomes  that  of  finding  a  unifold  curve  X(0  in  L  such  that 
the  following  interpolation  conditions  are  satisfied: 

The  condifibns  (12)  have  similarities  to  Hermite  ihtei^lation 
conditions  in  CAGD  and  ate  therefore  termed  /ferm/te  type 
motion  interpolation  conditions.^ 

:i^'4.2  Hermite  Type  Interpolation.  A  unifold  cmbic  para- 
nie^c  cu^e  .in  1^^^  is  of  the  fo^.  ,  _  - 

W(/)  •  (13) 

where,  W;(/)  denotes  general  coordinates  of  a  point  on  the 
cubic,  i(0  denotes  its  nbmalized 'coordinates,' and  de¬ 
notes  the  normalizing  factor.  We  seek  to  determine  the  vectors 
of  coefficients  §,  (/=0,  1,  2,  3)  such  that  the  conditions  (12) 
are  satisfied.  Since  the  first  derivative  +  wX,  the  Her¬ 

mite  conditions  (12)  are  equivalent  to 

,^(0)=i^0)Xo.  W(D=i^DXi, 

W(0)=^(0)S:o+M<0)fioto,  ^(l)=^(l)&,  +  Vv(l)fi,ti.  ^  ' 

From  (13)  and  (14),  the  coefficient  vector  i/  (/=0, 1,  2,  3)  can 
be  solved  to  yield  the  following  cubic  interpolant: 


W(r)  =  [i 


w(0)Xo 

^(0)S:o+w(0)fioto 

»^(DX, 


(15) 


where  [M/,]  is  the  standard  Hermite  basis  matrix  (see  Faux  and 
Pratt,  1979): 


IM,]  = 


10  0  0 
0  10  0 
-3  -2  -1  3 
2  1  1-2 


The  interpolant  (15)  is  here  referred  to  2i  Hermite  type  cubic 
motion  interpolant.  For  a  given  set  of  Hermite  conditions  (12), 
the  corresponding  inten^olant  is  not  unique,  because  the  dual 
numbers  h<0),  w(1),  i^(0),  i^(1)  can  be  arbitrary  chosen.  In 
practice  we  require  w(0)  and  >^1)  to  have  positive  r^  parts. 

The  Hermite  type  interpolant  (15)  can  be  rewritten  as 


;.'«^(0  =  w(0X(D  =  [l//VllA/«] 


»^l)X, 


(16) 


where  the  matrix  [Mff],  called  the  generalized  Hermite  basis 
matrix,  is  given  by 


1  0  0  0 

70  10  0 

-3-270  -2  -1  3-71 
2  +  70  1  1-2+71 


(17) 


where  7o-ii<0)/vK0)  and  7i  =  ikl)/^l)  arbitrary  dual 
numbers. 


^  4.3  Bemstein-Kzier  Type  Interpolation.  Let  6o  =  Xo, 
b3=Xi.  Then  the  Hermite  interpolation  (16)  can  be  put  into 
the  following  form: 


W(/)  =  H-(0X(0  =  [1 


w(0)6o  ' 
i^O)fio6i 
H<l)fi,62 
vKDfii 


where  the  matrix  [Mg]  is  given  by 


(18) 


X' 


1  0  0  0 
-3+70  .3  0  0' 

3-270  -6  3  -7i 

-1+70  -3  ‘:-3  1  +  71 


and  fijr  =  io +5^0/3,  62  =Xi.-  fi/3  me  points  on  Ae  tangent 
lines  XoAto,  HiAt x ,  respectively.  The  nrntrix  [M^J  .becomes 
the  standard  Bemstein-BiSzier  basis  matra  whieii  70=70 =0- 


1 

0 

0 

0 

-3 

3 

0.. 

0 

3 

-6 

3 

0 

-1 

3 

-.3 

1 

Therefore,  the  matrix  [Mjj]  is  termed  generalized  cubic  Bern- 
stein-Bizier  basis  matrix  and  the  interpolant  .(18)  is  termed 
generalized  Bernstein-Bizier  type  motion  interpolant. 

The  generalized  Bernstein-Bezier  interpolant  can  be  rewrit¬ 
ten  as: 


W(0  =  >S>(f)X(0  =  [l 


ivobo 

w-,bi 

W,b2 

^363 


(20) 


where  [Mj,]  is  given  by  (19)  and 
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*i»o  =  »<>(0) ,  »vi  = 

H’2=»^l)Slg|,  W3=M>(1), 

b,  =  (b;  +(V3)bo]/|o.  lo=  lb;  +  (V3)bol. 

b:  =  (62  -  (y,/3)63]/|o,  li  =162+  (V3)b3 1  • 

Note  that  61,  62  are  arbitrary  points  on  the  tangents  boAbi", 
62  Abj,  respectively.  Note  also  that  the  Bezier  interpolant  (20) 
is  invariant  to  change  of  both  fixed  and  moving  coordinate 
frames. 


5  Differential  Properties  and  Smoothness  Conditions 

In  this  section,  we  first  summarize,  in  the  language  of  ori- 
entable  image  space  E,  the  differential  properties  of  image 
curves,  which  were  developed  by  McCarthy  and  Ravani  (1986) 
for  the  case  of  the  classical  image  space.  We  then  apply  the 
results  to  the  problem  of  smooth  joining  of  two  motions  seg¬ 
ments. 


5.1  Differential  Properties.  The  differential  geometry  of 
curves  in  E  is  developed  analogous  to  the  differential  geometry 
of  curves  in  Euclidean  three-space.  Let  i(/). denote  the  signed 
unit-normalized  coordinates  of  an,  image  curve.  The  funda¬ 
mental  result  is  the  Frenet  equations  wWch  characterize  the 
curve  in  terms  of  the  differential  motion  of  a  special  tetra- 
h^ron  with  oriented  points  t,  jfi,  ft,  X  ^  its  verticil: 


dfl/ds-r  ~Kt+fft, 
d6/^=-TK 


The  dual  numbers  5,  jc  and  f  are  the  dual  arc-length,  curvature 
and  torsion  of  X(0,  respectively.  The  vertex  T  is  .defined  in  • 
the  same  way  as  the  unit  tangent  vector  of  a  Euclidean  curve: 


dX/dt 

dS~  \d%/dt\" 


(22) 


The  vertices  N  and  ft  are  defined  in  such  a  way  that  the  four 
points  X,  ft  form  a  self-polar  tetrahedron  called  the 
Frenet  /e/rj/iedron.^nie  manner  of  defining  N,  6  is:similar  to 
that  of  the  normal  and  binormal  vectors  of  acurve  in  Euclidean 
three-space,  see  tioCanno  (1976)  2uid  McCarthy  (1987). 

The  tangent  lirie-scgmentu)f  i(/)  is  ^ven  by  the  following 
bivector: 


:  .  XaX=^^,  .  (23) 

where  v=  Icfi/d^l.-and^l*  is  the  pojm.point  defined  by.  (22). 
Yhe  cui^atine  fim^on  Md  ttie  jbs^  pime  me  giyeri  by 
the  following  S^yxtbr:  : :  ^  ■ 

^  -  ■:h  ■  ■ 

The  torsion  prope^  IS  .  given  by  ^ 

XAXAX'Ai=<5®ic^fjCAtASA6.  (25) 

The  wedge  prddqc^  of  the  above  derivatives  can  be  eraressed 
in  terms  of  the  nbnnqrmalized  coordinates  W(/)  =  ivX(0- 


iAtAX  =  (1/#)WaWaW, 
Xt\1LkkA% = (i/iv^)  wa^ai^aW. 


5.2  Smoothness  Conditions.  Consider  two  cubic  curve 
segments  X::  and  X+  that  are  smoothly  joined  together.  We 
may  think  of  each  segment  as  existing  by  itself,  with  local 
parameters  /+,  respectively,  defined  over  the  interval  [0, 
1].  We  may  also  think  of  the  two  as  two  segments  of  one 
composite  curve,  with  a  global  parameter  u  defined  over  the 
interval  [w_,  u+].  The  "‘left”  segment  X_  is  defined  over  [w-, 
Wo],  while  the  “right”  segment  X+  is  defined  over  [uq,  u+]. 


Then  the  local  parameters  r_,  are  related  to  the  global 
parameter  u  by 

I  U-U_  ^  U-Uq  U-Uq 

Uq-U.  U^-Uq  A  + 

The  differential  operators  for  local  and  global  parameters  t 
and  u  are  related  by 

du  Adt’  diP:  dP  dP'  diP  A^  dp.' 

In  what  follows,  we  present  smoothness  conditions  of  a  com¬ 
posite  cubic  Hermite  image  curve  in  terms  of  the  continuities 
of  the  tangent  vector,  curvature  and  torsion.  We  always  assume 
that  the  two  given  segments  are  continuous  at  the  junction 
point,  i.e.,  X-(l)  =  X+(0). 

The  continuity  of  the  tangent  vector  at  u=Uq  of  the  two 
curve  segments  X..,  requires  that  the  first  derivative  with 
respect  to  a  global  parameter  u  is  continuous  at  the  junction 
point:  .  . 


dUu) 

■kJ 

1 

1 

1  X.(r.) 

du 

A_  dt. 

“=“0 

This  is  equivalent  to 

X_(i)Ai.(i)=oX+(d)Ai+(6) (27) 

where  ce= A-./A+.  If  a  is  any  other  dual  dumber  with  pq§itive 
real  part,  then  (27)  repr^ents  the  continmty  pXunit  tangent 
■vectors.  ‘ 

For  curvature  continuity,  in  addition  to  (27),  the  curvature 

3- vector  is  required  to.  be  continuous:  s  j-.: 

Xl(l)Ai:.(l)AX_(l)  =  a^jC+(0)Ai+(())AX*(0).  (28) 

For  torsion  continuity,  in  addition  to  (27)  and  (28),  the  torsion 

4-  vector  is  also  required  to  be  continuous: 

x:.(1)aX_(1)aX.(1)aX_(1) 

=  a^+(0)Ai+(0)Ai+(0)AX+(0).  (29) 


5.3  Cubic  Parametric  Curves.  Given  an  image  curve  as¬ 
sociated  with  a  cubic  interpolation  of  four  distinct  displacer 
ments,  we  can  obtain  the- differential  properties Jn.  terms  of 
control  points  and  weight  factors  and  then  use  th^e  properties 
to  characterize  the  smoo|^^.of  the  resulting  motion.  In  what 
follows  we  only  pro\ide"the^  properties  at  end  points  of  the 
cubic  Hermite  type  interpolation  given  by  (16). 

The  talent  Ihae-segnients  at  the  end  points  of  the  Hermite 
type  image  curve  (16)  ^e  given  .by  ?  .  ,  , 

'  k(0)AX(d)=:fio:fcoAto;4^^V<r ,  (30) 

v<  -:oi  :  ;  '  &(l)Ai(l)=fi,XiAti: :  ;  .  (31) 

Let  wo=  Vi<0)  and  iivj  =  MX)'*  Tlie  curvature'  properties  afthe 
end  points  aregiven  by  1,5^  "  ^ 


i(q)AX(b)A;|(d) = 6^ 

.  ^  . .  ■,;'(32) 

X(i)Ai(iVx(i)=6^, 


(33) 


The  torsion  properties  at  the  end  points  are  given  by 
.  w? 

i(0)AX(0)AX(0)AX(0)  =  12-5fiofilXoAtoAX,At,, 
Wo 


X(l)AX(l)AX(l)AX(l)=l2^fiofiiXoAToAX,Ati. 

Wo 

Let  X,i,  fi_  it^  1,  Xo»  and  fio^o  denote  the  four  control  points 
of  the  “left”  cubic  segment  X_(t_),  and  Xo,  SoTo,  Xi,  and 
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Qiti  denote  those  of  the  “right”  cubic  segment  X+  {t+ ).  Then 
the  substitution  of  (30)  and  (31)  into  (27)  yields  a=l.  The 
curvature  continuity  condition  is  given  by 


^l+^^X-,+jfi-|t.,jAXoAto 


=  V^iXo^ToA  ~  -  QiT  1  + 


Note  that  the  weight  Wq  and  the  dual  speed  fio  liave  no  influence 
on  the  curvature  continuity  at  The  condition  (34)  can  be 
further  reduced  to 

(fiiti  +  Zo)A^to=0.  (35) 

where 

2o=%^fi-it-,+^(3+Y-,)i-,-(3-7*,)«..  (36) 

W|  Wi 

Equation  (35)  indicates  that  the  three  points  (fiiTi  +  2o), 
and  I'o  are  collinear.  In  other  words,  (fiit'i  +  Zo)  is  expressible 
as  a  linear  combination  of  &o.  and  tq^ 

+  (37) 

where  are  two  arbitrary  dual  numbers. 

The  condition  for  torsion  continuity  at  Hq  is  given  by 

(38) 

Note  that  the  weight  the  dual  spoed  ^haVe  no  influence 

on  the  torsion  continuity  at  Xq.  Eliminate  Ti  from  (38)  using 
(34)  to  obtain 

C_  1  ( w_  iX_  1  +  WiXi)A'1'_ jAXoATq  . 

=  Wi(3  +  7^,)X_iAXiAXoAto.  (39) 


6  A  Piecewise  Cubic  Hermite  Interpolation 

This  section  solves  the  following  problem: 

Given:  Image  points  Xo,  Xi,  •  •  •,  X/.^i  representing  L  con¬ 
figurations  of  a  rigid  body  in  physical,  space. 

Find:  A  piecewise  cubic  motion  of  the  object  that  passes 
through  the.  given  configurations  such  that  the  corresponding 
image  ctirve  has  curvature  and  torsion  continuities: 

The  given  image  points  are  chosen  as  jiihction  Henmte  points 
of  the  piecewse’irubic  Intei^  The  eoal  is  to  determne 
a  tangent  Vi^or  fi/t/  at  each  image  point  X//  where  fi/doibtes 
half  of  the  dual  sp^  and  t/  denotes  the  polar  point,  such 
that  the  resulting  interpolation,  has  the  prescribed  continuities. 

Consider  three  cubic  segments  with  the  junction  points  de¬ 
noted  by  X/_i,  i/,  &/+1,  and  X/+2*  where  /=  1,2,  •  -  •,  £-3. 
Assume  that  the  tangent  vectors  i ,  fi/T/  are  known  from 

the  continuity  conchtidns  bf 'the  preceding  cubic  segments.  In 
what  follows  we  develop  a  method  foi*  determining  the  tangent 
vector  fiz+itz+Tsuch  that  these  cubic  pieces  satisfy  the  con¬ 
ditions  for  curvature  continuity  at  X,-  and  torsion  continuity 
kt  Xj^hi*  is  the  essential  part  of  th^^  algorithm 

for  geherating  cubiC'Herimte  interim  spline  motions. . 

Position  knd  tangent  continuiti^  are  implied  by  the  Heir^te 
interpolation.  In  view  of  (36)  and  (37),  the  curvature  continuity 
at  X/ is  given  by 

=  (40) 

where 


(3  +  7,-  ,)X,-. ,  -  (3  -  7,>  ,)X,> 

W/+I  W/+1 


The  choice  of  C;,  c/  must  be  such  that  t/+i  is  a  polar  point  of 
X/+1  and  that  the  cubic  segments  satisfy  torsion  continuity 
condition  at  X/+ 1. 

The  point  T/+ 1  is  a  polar  point  of  X/+ 1  if  T/+ 1  ‘X^.  i  =  0  which 
yields,  after  the  substitution  of  (40): 


(41) 


In  view  of  (39),  the  condition  for  torsion  continuity  at  X/+ 1  is 
given  by 

n/(w,X,+ W/.,2X/+2)At/AX/+iAT^^ 

.=.w,>2(3:+7/)XA^  (42) 

After  the  substitution  of  (40),  Eq.  (42)  becomes 

[fiA-  (3  +  7/)c;iiiAt>iAX,^2At,  =  0,-.  (43) 

where 

0, = fi-  A^f+iAtiAi, 

^■(3+7Di,Ai;.Ai7^2A2,-.  (44) 

Equation  (43)  is  essentially  a  scalar  equation,  and  together 
with  (41),  we  have  a  system  of  two  dual-number  equations 
with  two  dual-number  imtaiowns,  namely,  q  and  cl.  Once  § 
and  c/  are  solved,  Eq/ (41))  can  be ’used  to  determine  and 

With  the  above  method,  one  can  generate  thCitangeht  vectors 
fi.f 2, 3i  - >  ^2)  such  that^ih^  ir^ultingpibc^^edibic 

Hemufe  image  curve  has^curvatufe  continui^a^l,  torsion 
contihui^  at  i^^  and  duiy^to  at 

the  int^eeh^ints; ,^:|3)i^|(Wd^  that 
the  fireY  two  t^geht  v^ors 

left  with  tasks  of  spedfe^g  the  end  tohditipiK:’ one  is  to 


select^to^  fii^such  that  the  residtmg  M^^ 
cohtihuotiS'kRXi;  ^d  til?  other  ¥  tangent 

vectOT  Ol-  such  that  the  image"  cmv?^^ 
tinuous. at  Xt-2-''' 

Tobbtain  the  fir^st  two  t^gent  vectors  fioTo^SiTi*  we  sp«:ify 
two  unit  dual  vectors  Vo/ that  represent  the  instantaneous 
screw  axes  'at^'configurations  Xo,  Xi,  respectively,  and  then 
compute  to,  Ti  from  the  following  quaternion  product: 


to=(l/2)voio,  t,  =  (l/2)v,5i.. 

To  achieve  torsion  continuity  at  Xj,  we  obtain  the  dual  speed 
fio  "  . 

A  ^2(3  7o)  ^  XoA!X[2A]S!!i  aT  1 1 

in  view  of  (39).  the  dual  spbed'Or'can  be  arbitrary  chosen.  To 
pb&n  the  fast  ^gent  vectors,  we*  spedfy  an  addtional  image 
point  kud  then  compute  fijr  ^  ,t£_'i-Yismg  (41)  and  (43)  so 
that  curvatur?  continuity  at  am  be  achieved.  It  should 
be  noted  that  the  path  of  the  interpolating  motion  is  very 
sensitive  to  the  choice  of  these  end  conditions.  Further  research 
is  needed  to  obtain  a  set  of  “optimal”  end  cohditionSj;sijch 
as  those  that  would  produce  not  o^y  smoo^  but  alsb  naturi^y 
looking  motions.  -  ^  ^  ^  •  • 

.  The  weights  Tf^/‘  (/=  -1,  0,  L-1)  have. pbsitiye  real 

parts  and  can  be  set  to  iv;=  1  initiallyi>The  weights  7/  (/=  - 1, 
0,  •  •-  ,  Z;-l)^are  arbitrary  dual  number^  and;  can  be  set  to 
7/=  0  initially.^  These  parameters  can  be  adjusted  to  finertune 
the  interpolating  motion.  Figure  1  shows  a  set  of  seven  input 
configurations  and  Fig.  2  shows  a  piecewise  cubic  Hermite 
motion  with  cmrvature  and  torsion  continuities. 


Conclusions 

In  this  paper  we  have  presented  a  new  approach  for  smooth 
motion  interpolation  based  on  an  orientable  kinematic  map¬ 
ping.  The  mapping  reduces  the  problem  of  motion  interpo¬ 
lation  to  that  of  designing  an  interpolating  curve  in  an  orientable 
projective  dual  three-space.  Methods  for  cubic  interpolation 
and  piecewise  cubic  interpolation  of  displacements  have  been 
developed  taking  advantage  of  existing  techniques  in  CAGD. 
The  results,  in  addition  to  their  theoretical  interest  in  com- 
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Fig.  1  A  set  of  seven  configurations  of  an  object 


Fig.  2  A  piecewise  cubic  Hermite  Interpolating  motion  with  curvature 
and  torsion  continuities 


putational  geometry  of  motion,  have  a  number  of  applications 
in  engineering  and  computer  science. 
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APPENDIX  A 


Dual  Angle 

Let  X  =  X + eX®  and  = Y + e'V®  denote  the  uiut>nbnnalized 
coordinates  of  two  image  points.  Then  the  dual  distance 
^ ^ + «/i  (where  <t> = [0,  t])  between  them  may  be  symbolicaUy 
given  by  ^  =  cos“'(X«i).  If  X?«Y,  then  we  have 

cos" ‘(X  •’t) = cos  ■ '(X .  Y)  -  epc  •  Y® + X®.  Y)/(l  ^  X .  Y)'^, 

in  the  sp^al  case  when  X = Y,  the  displarements  X  ^(J^are 
related  by-a  pure  tramlation  of  distance -2 ll^.~.‘T-X®X-rM, 
where  lY^  denotes  the  magmtude  of  the  vector 

quaternion  (Y®Y  ;:'  X®X“  *).  Therefore  we  ^may  define 
cos-‘(X.X)as  " 

-  -  cos"‘(X.X)  =  elY®Y"‘-X?X"M;..  v  ^ 
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A  PJP  E  N  D  I 

Normalization 

^  General  image  space  coordinates,  = w + may  be  nor¬ 

malized  such  that  the  correspondihg  nonrialized  <»or(^at^, 
X  +  eX®,  satisfy  X*X  ==  1  and  X»X® = 0,  This  involves  com¬ 
puting  the  norm,  iv-  w+ew®,  of 

w=(W.W)'^, 

and  follows  by  dividing  ^  with  w: 

X=:W/>v=(JKi/w,  ITj/iv,  1^4/w), 

Division  of  a  dual  number  by  vv  is  possible  and 

unambiguous  if  w 7^0  (Bottema  and  Roth,  1979): 

0  —  -j-  g - - 
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Geometric  Construction  of  Bezier 
Motions 

This  paper  deals  with  discrete  computational  geometry  of  motion.  It  combines 
concepts  from  the  fields  of  kinematics  and  computer  aided  geometric  design  and 
develops  a  computational  geometric  framework  for  geometric  construction  ofrno- 
tions  useful  in  mechanical  systems  animation,  robot  trajectory  planning  and  key 
framing  in  computer  graphics.  In  particular,  screw  motion  interpolants  are  used  in 
conjunction  with  deCasteljau-type  methods  to  construct  Bezier  motions.  The  prop¬ 
erties  of  the  resulting  Bezier  motions  are  studied  and  it  is  shown  that  the  Bezier 
motions  obtained  by  application  of  the  deCasteljau  construction  are  not,  in  general, 
of  polynomial  type  and  do  not  possess  the  useful  subdivision  property  of  Bernstein- 
Bdzier  curves.  An  alternative  form  of  deCasteljau  algorithm  is  presented  that  results 
in  Bezier  motions  with  subdivision  property  and  Bernstein  basis  function.  The  results 
are  illustrated  by  examples. 


Introduction 

One  of  the  basic  problems  in  computer  aided  animation  of 
three-dimensional  objects  is  that  of  motion  approximation. 
This  is  the  problem  of  finding  a  motion  that  approxiinat^  a 
sequence  of  arbitrary  displacements  or  configurations  of  an 
object.  If  the  motion  is  made  to  interpolate  through  these 
configurations  (called  key  configurations),  then  the  problem 
of  motion  approximation  becomes  that  of  motion  interpola¬ 
tion.  Such  motion  approximation  methods  are  of  fundamental 
importance  in  computer  animation  of  three-dimensional  ob¬ 
jects  and  have  applications  in  computer  graphics,  wmputer 
vision  and  simulation  of  mechanical  systems. 

Our  work,  in  this  paper,  is  motivated  by  the  work  by  Taylor 
(1979)  and  Shoemake  (1985)  who  used  quaternions  to  develop 
methods  for  animating  rotations.  Taylor  (1979)  used  quater¬ 
nions  for  linear  interpolations  of  rotations.  Shoemake  (1985) 
interpreted  quaternions  as  defining  a  hypersphere  and  used  the 
deC^teljau  algorithm  to  obtain  the  so-called  spherical  B^ier 
curves  for  animating  rotations.  The  work  of  Shoemake  was 
further  refined  and  extended  by  Duff  (1986)  at  AT&T,  and 
Pletinckx  (1989).  These  researchers,  however,  did  not  study 
some  of  the  fundamental  properties  of  the  resulting  motions. 
Shoemake  (1985)  raised  several  unanswered  questions  on  the 
characteristics  of  the  resulting  spherical  Bezier  curves.  Here 
we  develop  a  theoretical  framework  that  enables  us  to  study 
the  basic  characteristics  of  the  resulting  motion  and  provide 
answers  to  some  of  these  questions.  We  show,  for  example, 
that  a  motion  obtained  by  application  of  deCasteljau  con¬ 
struction  is  not  in  general  of  Bernstein  form  and  is  non^ge- 
braic.  Furthermore,  such  a  motion  does  not  have  the  subdivision 
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property  of  Bemstein-Bezier  curves.  We  then  present  a  mod¬ 
ification  of  this  algorithm  that  results  in  Bezier  motions  with 
subdivision  property  and  basis  functions  which  are  of  Bernstein 
form.  This  paper  compliments  the  analytical  results  presented 
in  our  companion  paper  (Ge  and  Ravani,  1994)  in  providing 
discrete  (rather  than  continuous)  computational  algorithms  for 
motion  interpolation  and  approximation.  ^ 

The  organization  of  the  paper  is  as  follows.  First  we  discuss 
geometric  representation  of  spatial  ^splacements  in  terms  of 
an  image  space  of  a  kinematic  mapping  (see  Ravam  and  Roth, 
1984).  Section  2  deals  with  the  simplest  form  of  inotion  in- 
teipolation  namely  that  of  constant  speed  screw  motions.  Sec¬ 
tion  3  presents  a  deCasteljau  construction  algorithm  bas^  on 
repeated  screw-motion  interpolations  for  the  design  of  non- 
algebrmc  B&der  motions.  In  this  section,  we  also  study  the 
kinematic  properties  of  the  resulting  B6zier  motions.  In  Section 
4,  we  will  present  a  slightly  modified  form  of  the  deCasteljau 
algorithm  using  special  projection  that  results  in  B6zier  motions 
which  are  of  Bernstein  form. 

1  Geometric  Representation  of  Displacements 

It  is  well  known  that  a  general  displacement  in  a  Euclidean 
three-space  (denoted  by^E^)  has  a  fixed  line,  call  ^e  screw 
axis.  The  position  and  direction  of  the  screw  axis  in  E  remains 
the  same  before  and  after  the  displacement.  The  sense  of  ^- 
rection  of  the  axis  is,  however,  undetermined.  The  spatial 
displacement  is  commonly  characterized  as  an  unoriented  screw 
displacement,  which  is  a  rotation  about  and  a  translation  along 
an  undirected  screw  axis. 

A  screw  axis  with  a  uniquely  defined  sense  of  direction  is 
called  a  directed  screw  axis  or  a  spear  (see  Bottema  and  Roth, 
1979).  A  screw  displacement  about  a  directed  screw  axis  is  an 
oriented  screw  displacement.  Two  screw  displacements  are  con¬ 
sidered  to  be  “oppositely  oriented”  if  their  screw  axes  occupy 
the  same  position  in  space  but  with  opposite  sense  of  direction. 
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Fig.  1  Three  configurations  (Afo,  and  belonging  to  a  motion  of 
body  M  in  physical  space  F.  Lines  Sq,  and  S2  denote,  respectively, 
the  screw  axes  for  displacements  and  F-M2. 


In  this  way,  a  general  displacement  can  be  reduced  to  one  of 
the  two  oppositely  oriented  screw  displacements.  These  two 
displacements  are  geometrically  equivalent  but  topologically 
different;  one  may  be  called  a  “forward**  screw  displacement 
and  the  other  may  be  called  a  “backward**  screw  displacement. 

The  orientation  of  a  screw  displacement  is  not  an  issue  when 
only  discrete  displacements  are  considered.  When  a  set  of  screw 
displacements  are  used  to  construct  a  continuous  motion,  how¬ 
ever,  one  has  to  pay  special  attention  to  their  orientation. 
Consider  a  motion  of  a  rigid  body  in  the  Euclidean  three-space 
We  attach  coordinate  frames  M  and  F  to  the  body  and 
the  space,  respectively.  Then  each  of  the  configurations  that 
the  body  attains  during  the  motion  is  represented  by  a  screw 
displacement  from  F  to  M,  see  Fig.  1 .  To  reconstruct  a  smooth 
and  natural  motion  from  these  screw  displacements,  one  has 
to  choose  the  sense  of  direction  of  the  corresponding  screw 
axes  consistently  throughout  the  motion.-  . 

Algebraically,  an  undirected  screw  axis  5. is  commonly  rep¬ 
resented  by  a  pair  of  normalized  Plucker  vectors  (s,  s^  where 
s=  (5jf,  Sy,  Sz)  is  a  unit  vector  along  S  and  So=  5°)  is 

the  moment  of  s  about  the  origin  of  a  fix’ed  reference  frame 
in  They  must  satisfy  the  condition  s*s®=0.  If  the  dual¬ 
number  unit  e  (defined  by  =  0)  is  used,  Plucker  vectors  can 
be  written  as  a  dual  vector  or  a  vector  of  dual  numbers: 

s  =  s  +  es°=(4,^^,^.)  0) 

where  f, = S/ + esf  ( / = x,  >^,  z )  are  dual  ntmibers.  The  dual  vector 
§  has  the  property  §♦§=  1  and  is  termed  a  unit  line  vector, 

A  directed  screw  axis  S  can  be  represented  algebraicaUy  by 
a  directed  unit  line  veaor.  By  this  we  mean  that  §  and  -s 
represent  a  pair  of  oppositely  directed  screw  axes  instead  of 
the  same  axis.  The  use  of  directed  screw  axis  instead  of  un¬ 
directed  screw  axis  removes  the  ambiguity  in  defining  the  mag¬ 
nitude  of  a  screw  displacement.  The  magnitude  is  defined  in 
terms  of  the  dual  angle  8  =  0+6/  between  two  directed  lines 
mi,  m2  that  are  perpendicular  to  the  directed  screw  axis  S, 
where  6  is  the  angle  from  mi  to  m2  about  S,  according  to  the 
right-hand  saew  rule,  and  /  is  the  signed  distance  from  mj  to 
mj  along  S.  For  a  pair  of  oppositely  oriented  but  g^metricsdly 
equivalent  screw  displacements,  their  dual  angles  6  and  6'  are 
related  by  the  ange  2x,  i.e.,  §  +  §'  =2x.  Thus  a  spatial  dis¬ 
placement  may  be  represented  either  by  a  forward  screw  dis¬ 
placement  about  the  screw  axis  s  with  a  dual  angle  §  or  by  a 
backward  screw  displacement  about  the  screw  axis  -§  with  a 
dual  angle  2t  -  §. 

Although  a  directed  screw  axis  s  and  a  dual  angle  6  com¬ 
pletely  prescribe  an  oriented  screw  displacement,  they  are  sel- 
domly  used  directly  as  a  representation  of  the  displacenient. 
Instead,  they  are  often  used  to  define  the  dual  Euler  param- 
eterSy  denoted  by  X=  {^1,  X2,  X^y  -^4),  as  (see  Bottema  and 
Roth,  1979): 
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Fig.  2  A  screw  motion  M{()  from  Mq  to  corresponds  to  a  line-segment 

(or  geodesic)  Xoi(0  from  I  to  In  £. 


Xx  =  f ,  sin  (§/2),  X2 = 52  sin  (§/2), 

Xi  =  Si%\n(d/2),  Xt  =  cos0/2),  (2) 

where 

sin  0/1)  =  sin  {B/2)  -He  ( l/l)  cos  (^/2), 
cos  (§/2)  =  cos  {6/2)-  €  (//2)  sin  {6/2), 

These  parameters  are  considered  to  be  signedy  for  a  backward 
screw  displacement,  prescribed  by  -s  and  2t-§,  corresponds 
to  the  dual  Euler  parameters  -  X  =  ( -^2»  --^3,-^4). 

In  this  paper,  we  consider  the  signed  dual  Euler  parameters 
as  a  set  of  four  signed  homogeneous  coordinates  that  define 
an  oriented  point  in  a  projective  space  with  three  dual  dimen¬ 
sions.  By  this  we  mean  that  the  dual  Euler  parameters 
X=  (Xi,X2, Xi,  X^)  and  iiX  =  ( ivA-,,  w;?4).  where 

iv=w -Hew®  is  a  nonpure  dual  number,  represent  one  and  the 
same  point  for  all  w>0,  and  that,  they  represent  a  pair  of 
oppositely  oriented  points  for  all  w<0.  In  particular,  the  two 
points,  X  and  -  X,  known  as  antipodal  points,  are  considered 
to  be  two  distinct  points  which  occupy  the  same  position  in 
the  space  but  with  opposite  orientations.  In  this  way,  a  general 
displacement,  which  is  geometrically  equivalent  to  a  pair  of 
oppositely  oriented  screw  displacements,  corresponds  to  not 
one  point  but  two  antipodal  points  in  this  projective  dual  three- 
space.  The  resulting  image  space  of  spatial  displacenients  is 
the  orientable  version  of  the  kinematic  mapping  discussed  by 
Ravani  and  Roth  (1984)  and  is  described  in  more  detail  in  Ge 
and  Ravani  (1994). 


2  Geometric  Construction  of  Screw  Motions 
This  section  presents  a  method  for  geometric  construction 
of  an  interpolating  screw  motion  that  enables  the  designer  to 
control  the  speed  of  the  interpolation.  This  forms  the  basis 
for  developing  a  deCasteljau  construction  algoritlun  for  mo¬ 
tion  interpolation  and  approximation. 

2.1  Starting  From  a  Reference  Configuration.  A  finite 
displacement  of  a  rigid  body  M  from  an  initial  configuration 
Mq  to  the  next  configuration  A/j  is  shown  in  Fig.  2.  For  the 
time  being,  we  choose  the  fixed  reference  frame  F  to  be  coin¬ 
cident  with  Mq.  Then  the  configuration  Mo  is  represented  by 
the  identity  displacement  F— Mo  which  corresponds  to  a  specisd 
point  I  =  (0,  0,  0,  1),  called  the  identity  pointy  in  the  image 
space  E.  The  next  configuration  Mi  with  respect  to  Fis  rep¬ 
resented  by  a  forward  screw  displacement  F— Mi-  Its  screw 
axis,  Soi,  is  represented  by  a  directed  unit  line  vector 
Soi=  (Sxy  Syy  Sz)  and  its  magnitude  is  given  by  the  dual  angle 
2^  =  2</)  -H  €(2/i  ) .  These  screw  parameters  define  the  image  point, 
qgi,  of  the  forward  screw  displacement  F—Mj: 

Qoi  =  {SxSin^y  SySin^,  ijsin^,  cos$). 
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Now  the  problem  of  determining  the  interpolMing  screw 
motion  M(/)  becomes  that  of  determining  a  ““fold  hne-seg- 
ment  (or  geodesic)  Xoi(/)  in  E  such  that  Xo,(0)=I  and 
jL„fn=afti,  see  Fig.  2.  The  Pluckier  coordinates  of  this  line  is 
given  by  the  vector  wedge  product  lAqoi=  (^xSin<^,  ^sin^, 
f  sin$,  0,  0,  0).  This  indicates  that  the  dual  vector  Soi  -  (s*, 
L  s,)  represents  the  direction  of  the  Une-segment  Xoi(0-  In 
view  of  the  additional  fact  that  the  length  of  the  hne  segment 
is  the  angle  we  obtain  the  following  parametric  equation 
for  the  line-segment  (or  geodesic)  from  I  to 

XmU)  =  (4sin  (u ( 1  +€p)^),  ^>sin  (w ( 1  +  ep)4>), 

fjSin  («(1-I-€J7)^),  cos  («(1  +  €P)^)).  (3) 

where  M=«(f)  is  a  r^-Valu<id..timng  function  defined  such 
that  w(0)=0  and  «(!)= 1;  and-p=p(0  is  another  red  valued 

function  defined  such  that  P(0)=JK1)=P.  The  function  «(0 
allows  control  of  angiilm^  velocity  of  the  resulting  serqw  motion 
andp(/)  allows  variation  bfthepitch,p(/)  +h/4>,  of  the  screw 
motion.  In  this  paper  we  choose  u(0 and  /i(0  -  0  so  tha 
the  screw  motion  defined  by  (3)  has  not  only  a  consmt  angular 
velocity  but  also  a  constant  pitch.  In  this  case,  (3)  becomes 

-  ■  (4) 


Xoi(0  =  (4sin  (r^),  4si“ 

and  is  called  a  uniform-speed  screw  interpolant} 

Note  that  in  representing  the  configuration  Mi  with  respect 
to  F.  the  choice  of  a  backward  screw  displacement  -  qoi  msteacl 
of  the  forward  displacement  §or  faults  in  a  serw  interpolant 
( t)  that  has  opposite  orientation  to  Xoi  (r)  (Fig-  3). 
Quaternion  algebra  is  an  elegant  tool  for  handling  trans¬ 
formations  in  eUiptic  three-space  (SommerviUe,  19  4)  and 
consequently,  in  the  image  space  (Ravam  and  Roth,  1984)  it 
the  image  point  4oi|is  represented  by  a  quaternion,  then 
Xo!  (0  given  by  (4)  is  expressible  as  a  power  of  the  quatenuon 
ao,  Let  (1,  i.  k.  1)  denote  the  quaternion  basis,  then 
t,  =  So,sin4+cos«  where  So,  is  a  v^tor  qua- 

w^niom  Sih^^i  =  -  (4+^+4)  =  - 1.  de  Moivre’s  theorem 
shows  that 

Xoi  (0  =  soi  sin  {t4>)  +  cos  {t$)  =  qoi.  ^  (5) 

In  view  of  the  dual-velocity  formula  \ ^2{dX/dt)X^]  (Ge 
and  Ravani.  1994),  it  can  be  shown  that  the  dual  velocity  of 
the  screw  interpolant  (5)  is  given  by  V  =  0soi.  This  confirms 
that  ^i(0  corresponds  to  a  constant-speed  screw  motion. 

2*2  Starting  From  an  Arbitrary  Configuration.  We  nw 
study  the  screw  motion  from  configuration  Mq  to  Mi  for  the 
case  when  the  fixed  frame  F  is  not  coincident  with  Mo  (see  Fig. 
4).  Let  6o  and  bi  denote  the  image  points  of  the  configurations 
Mo  and  Mi  relative  to  F,  respectively.  Let  Qoi  denote  the  image 
point  of  the  displacement  Mo— Mi  measured  with  respect  to 
Mq.  The  fact  that  the  displacement  F^Mi  is  the  composition 

2  Here  the  term  “screw”  is  used  instead  of  “linear”  to  differentiate  the  linear 
motion  of  the  form  (26). 


X(<)  from  bo  to  b,  In  t 

of  the  displacements  F-Mj  and  Mo-M,  can  be  expressed  by 
the  quaternion  product  b,=bo4oi-  From  this  we  obtain 
^(„  =  bo '6,  where  bo  '  is  the  inverse  of  the  unit  <l“ate™ 
So!  Thus  the  constant-speed  screw  interpolant  from  Mo  to  M, 
described  with  respect  to  F  is  given  by 

X(r)=boqoi  =  bo(bo‘bi)',  /  =  [0,  1].  (6) 

Kinematically,  Eq.  (6)  means  that  the  “5?™«dmte 
ment  represented  by  X(/)  is  the  composition  of  the  two  dis 
placements  F-Mo  and  Mo-M(r), 

fnterpolant  (6)  is  invariant  with  respect  to  change  of  both  the 
moving  and  fixed-coordinate  frames.  ^  *  i  ♦  j  u,. 

When  the  two  configurations  bo  and  b,  are  not  related  by 
a  pure  translation,  the  formula  for 

by  the  right  hand  side  of  (6),  denoted  by  L(bo.  bi.  r),  can  be 
put  into  the  following  vector  form 

.  «  sin((l-f)^)c  .  sin (t4>)c.  ,■7^ 

»«♦  ”'  " 

.here  }.*+<*(♦  0)  is  Ihc  dual  (^anettatwOT  *e 

ha,  and  b.  see  Fig.  4.  Note  that  the  real  part  of  (7)  is  the 
spherical  linear  interpolation  used  mShoemake  (1985)  and  can 
be  obtained  directly  using  analytical  eUipuc  geometry  (see  Som- 

'"when  tlw  two  configurations  are  related  by  a  pure  trans¬ 
lation,  i.e.,  when  <#.= 0,  Eq.  (7)  becomes  a  hnear  interpolation. 

t(bo,b,;  0  =  (l-0bo-t-/b,  (8) 


since 


sin((l-0<^)_,  . 

hm - -i-‘i 

^_o  sin<^ 


sin  (t^)  , 
hm — 7— s— =t. 


(9) 


sin4> 

This  treatment  is  necessary  dmsion  by  a  pure  dual  number 

eh  is  not  defined  (Bottema  and  Roth,  1979).  Let  di,  and 

d(0  denote  the  vectors  in  the  fixed  frame  F  which 

the  origins  of  the  body-fixed  coordinate 

and  M(/),  respectively.  It  is  easy  to  show  that  Eq.  (8)  is  equiv 

alent  to  linear  interpolation  of  the  prions: 

d(0  =  (l-/)do-i-tdi. 

3  Nonalgebraic  B6zier  Motions 
The  screw  interpolant  in  the  preceding  section  can  be  applied 
repeatedly  for  constructing  curves  in  the  image  space  in  a 
manner  similar  to  the  deCasteljauwnstru^on  known  in  c^ 
puter  aided  geometric  design  (see  Farm,  1993).  ^bis  idea  stw 
with  Shoemake  (1985)  who  constructed  the  so-c^led 
Bezier  curves  in  the  space  of  unit  quaternions  for  animating 
rotations.  The  following  deCasteljau 

sidered  as  a  dual  form  of  Shoemake’s  spherical  deCasteljau 
algorithm. 

3.1  A  DeCasteljau  Algorithm  for  Bezier  Motions.  Let 
g,  ,_o,  1,  . . . , «  denote  n  + 1  image  points  of  spatial  displace- 
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Fig.  6  Four  Bezier  control  configurations  together  with  a  few  config¬ 
urations  belonging  to  the  Bezier  motion 


0  t 

Fig.  5  A  deCasteijau  algorithm  in  I 


ments.  They  form  a  control  polygon  in  the  image  space  L,  see 
Fig.  5  for  the  case  when  /i  =  3.  For  r=  I,  2,  •  •  /=0,  I, 

*  •  n  —  r  and  0</<l,  an  intermediate  point  on  each  screw 
interpolant  is  given  by  the  recursive  formula: 

6?(o=6r'(o[(6r‘(/))''b;:,'(/)]',  dO) 


where  6°  =  b„  0?  =  0,.  The  points  bp  ‘(t),  b/iT i‘  ( r ) ,  bj ( /)  denote, 
respectively,  jhe  starting  point,  end  point  and  intermediate 
point.  Let  denote  the  dual  distance 

between  6p’(/)  and  (0-  Equation  (10)  is  equivalent  to 
the  following  unifold  dual-spherical  linear  interpolation  when 


b;(t)= 


sin((l  -t)iy 


hr' 


{!)  + 


sin 


b;: ,'(/).  (11) 


sin<^,"‘  ‘  '  '  sir\<t>-' 

When  =  Eq.  (10)  reduces  to  the  linear  interpolation 

b;(0  =  (l-/)br'(O  +  rb;:,'(r).  (12) 


By  varying  the  parameter  /  in  the  range  [0,  1],  the  point 
b5(/)  traces  out  a  curve  in  E  with  control  points  b^  (/  =  0,  1, 
•  •  n).  In  Section  3.3,  we  will  see  that  although  bo(/)  is  in 

genreal  not  a  polynomial  curve,  it  satisfies  end-point  inter¬ 
polation  conditions  in  a  manner  similar  to  the  Bezier  curves 
in  We,  therefore,  call  this  curve  a  Bezier  image  curve  of 
rank  n.  The  corresponding  motion  is  termed  a  Bezier  motion 
of  rank  n.  Figure  6  shows  four  control  configurations  of  a 
robot  gripper  together  with  several  configurations  belonging 
to  the  Bezier  motion  of  rank  3.  Figure  7  shows  the  entire  Bezier 
motion  resulting  from  the  application  of  the  above  de- 
Casteljau’s  construction. 

We  note  that  if  all  control  points  are  collinear  in  £,  the 
Bezier  image  curve  degenerates  into  a  unifold  geodesic  line- 
segment  which  corresponds  to  a  screw  motion.  If  all  control 
points  are  coplanar,  the  Bezier  image  curve  becomes  a  unifold 
planar  curve  which  corresponds  to  a  line-symmetric  motion. 


3.2  The  Derivatives  of  Bezier  Image  Curves.  If  we  expand 
the  derivative  of  the  screw  interpolant  (11)  using  the  chain  rule, 
we  obtain 


bR0=<^r‘T;(0  + 


sin  ((1  -/)0r‘),-r- 

sin^r'  ' 


sin  (/^/“  )  -  .  ,  -r  1  f  -r 


'(/) 

cos((i 

sin  ir ' 


br'(/) 


cos  4>'  ■  ‘ 
sin^p' 


(13) 


where 


Fig.  7  The  entire  Bezier  motion  of  rank  3 


•  -  ^  COS  ‘)b/+/(/) -cos((I--/)0/  )bj  (0 
T[(t)  = - ^  — —  (14) 

is  a  polar  point  on  the  line-segment  b^O^  for  t[(/)  *6/ (/)  =  0 
and  T/(0*T/(0  =  L  The  image  space  can  be  interpreted,  at 
least  locally,  as  a  hypersphere  in  a  spacce  of  four  dual  di¬ 
mensions  (see  McCarthy  and  Ravani,  1986).  Then  if  b^(^/)  is 
interpreted  as  a  great  circular  arc  on  the  hypersphere,  t-(/) 
is  the  unit  tangent  vector  to  the  arc  at  t. 

The  derivative  of  a  Bezier  image  curve  may  be  obtained 
using  the  recursive  formula  (13).  The  explicit  formula  for  the 
derivative  is  in  general  rather  complicated  except  at  the  end 
points  /  =  0  and  /=1.  When  r  =  0,  we  have  0/'’(O)  =  ^/, 

^'r\0)  =  0y  T/(0)  =  T-(0)  and  therefore  Eq.  (13)  is  reduced  to 
6RO)  =  0,t‘(O)  +  6r‘(O).  (15) 

When  t  =  1,  we  have  0r‘(l)  =  '^/‘‘(l)  =  0,  t-(I) 

=  T,'+,_i(l)  and  Eq.  (13)  is  reduced  to 

b;(l)  =  ^,>,.,T!.,.,(l)+6?:,'{l).  (16) 

From  (15)  and  (16)  we  obtain  the  derivatives  of  a  Bezier  image 
curve  at  r  =  0  and  /.=  1: 


bS(  1 )  =  /70oti(O).  6S(  1 )  =  . ,  ( i ) 

where  To(0),  Ti.|(l)  are  unit  tangent  vectors  given  by 


Ti(0)  = 


bi  ~  bp  cos  4>o 


i(l)  = 


b„  cos  _  I  -  b„  - 


(17) 


(18) 


sin<i>o  .  sin(i)„., 

For  a  Bezier  image  curve  of  rank  3,  its  tangent  bivector  can 


also  be  given  in  terms  of  its  control  points  as 

boAbi, 


b^(0)AbJ(0)  =  3 


sin<i>o 


(19) 
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b^(l)AtJ(l)  =  3^62Ab3,  (2G) 

Sin  92 

where  ^oi  ^2  are  the  dual  angular  distances  between  bo,  hi  and 
62,  63,  respectively. 

The  second  derivative  of  the  screw  interpolant  (11)  is  ob¬ 
tained  by  differentiating  (13).  The  resulting  recursive  formula 
can  be  used  to  obtain  the  second  derivative  of  a  Bezier  image 
curve.  The  derivation  is  rather  lengthy  and  is  not  included  in 
this  paper.  At  /=0.  the  recursive  formula  for  computing  the 
second  derivative  is  given  by 

6K0)  =  -  M,b,+ ‘(O)tJ(O)  +  %D?(0)  +  6r  ‘(0).  (21) 


where 


DR0)  = 


b;:i‘(0)-br‘(0)cos^, 

sin  0/ 


Equation  (21)  can  be  used  to  determine  the  curvature  3-vector 
of  a  Bezier  image  curve  of  rank  3  at  if  =  0: 

6k0)A6^(0)A6^(0)=18  .  22  ■  i  boAb|Ab2.  (22) 
sin  <f>Q  sin  <p  I 

Similarly,  the  curvature  3-vector  at  f  =  1  is  obtained  as 


bJ(l)AbJ(l)AbJ(l)=l8 


sin^^2Sin^i 


biAb2Ab3.  (23) 


The  dual  curvature  k(t)  at  r  =  0  is  given  by 

20,  sin^o 


lbS(0)AbJ(0)Ab?(0)l 

k{0)  = - — - t: - ■ 

lb?(0)AbJ(0)l 


300  sin  00  sin  0, 


(24) 


where  j^o  is  the  dual  angular  distance  from  the  point  b2  to  the 
line  bob,. 

It  is  interesting  to  note,  in  the  limiting  case  when  0/— 0  (/ = 0, 
1),  the  tangent  and  curvature  properties  of  the  Bezier  image 
curves  of  rank  3  at  /  =  0  and  r  =  1  approach  to  those  of  Bezier 
cubics  in  Euclidean  three-space  at  f  =  0  and  f  =  1,  respectively. 
For  example,  at  /  =  0,  the  tangent  bivector  (19)  becomes 
3boAbi,  the  curvature  3-vector  (22^ becomes  18boAbiAb2,  and 
the  dual  curvature  becomes  2^q/3$o, 


3.3  More  Properties  of  Bezier  Motions.  Equations  (17) 
and  (18)  show  that,  similar  to  Bezier  curves  in  Euclidean  three- 
space,  the  image  curve  bo(0  is  tangent  to  the  first  leg  bob, 
and  the  last  leg  b^^ib;,  of  its  control  polygon.  In  addition,  the 
magnitude  of  the  tangent  is  n  times  that  of  the^  leg  itself. 
Furthermore,  it  can  be  shown  that  the  segment  bo  *^67”^  is 
tangent  to  the  curve  bo{0-  Kinematically,  this  means  that  the 
corresponding  Bezier  motion  of  rank  n  (denoted  by  bo(0 )  bas 
the  following  properties: 


(1)  End  configuration  interpolation.^ 

(2)  At  end  configurations  bo  and  £«,  the  instantaneous 
screw  axes  of  bo  ( 0  are  given  by  the  screw  axes  of  screw 
motions  bo(0  and  b^i>i(/),  respectively. 

(5)  The  initial  speed  of  bo(0  at  configuration  bo  is  n  times 
that  of  the  screw  motion  bo(0  and  the  final  speed  of 
bn(f)  at  b;,  is  n  times  that  of  the  screw  motion 

bL,(0. 

(4)  The  instantaneous  screw  axis  at  the  configuration 
bo(0  is  the  same  as  tjiat  of  the  screw  motion  from 
configuration  bS"^  to  b7"^ 

Bezier  motions,  obtained  in  this  fashion,  are  also  coordinate- 
frame  invariant  since  screw  motion  interpolants  are  coordinate- 
frame  invariant. 

Figure  8  shows  an  entire  Bezier  motion  of  rank  3  and  three 
constant-speed  screw  motions  defined  by  the  four  given  control 
configurations.  Clearly,  the  screw  motion  from  bo  to  6,  is 
* ‘tangent”  to  the  Bezier  motion  at  bo  and  the  screw  motion 
from  82  to  63  is  “tangent”  to  the  Bezier  motion  at  63. 

In  addition  to  the  aforementioned  end-point  interpolation 


Fig.  8  Bezier  motion  of  rank  3  and  the  three  constant  speed  screw 
motions 


properties,  in  the  special  case  when  all  configurations  share 
the  same  orientation,,  the  Bezier  image  curve  bo(0  reduces  to 
a  Bernstein-Bezier  polynomial  curve,  for  it  is  the  result  of 
repeated  linear  interpolations  of  form  (12).  The  corresponding 
motion  is  translational  motion  and  end  point  of  the  rigid  body 
undergoing  the  motion  traces  out  a  Bernstein-Bezier  polyno¬ 
mial  curve  in  the  Euclidean  three-space  £^. 

In  genreal,  however,  the  Bezier  image  curve  bo(r)  is  fun¬ 
damentally  different  from  a  Bernstein-Bezier  curve  in  First, 
while  a  Bernstein-B6zier  curve  in  is  a  polynomial  curve,  the 
image  curve  bo(t)  is  in  general  not  algebraic  but  transcenden¬ 
tal,  for  it  has  in  general  infinitely  many  intersections  with  a 
properly  oriented  twofold  plane  in  the  image  space.  Second, 
while  a  Bernstein-Bezier  curves  in  E^  has  subdivision  property, 
the  image  curve  bo(0  in  general  does  not  possess  this  useful 
property  of  subdivision  (see  Appendix  A  for  the  proof).  Con¬ 
sequently,  the  image  curve  traced  out  by  the  point  bo(0  when 
t  varies  from  0  to  1  is  in  general  different  from  the  image  curve 
resulting  from  subdividing  the  control  polygon  formed  by  6/ 
(/=0,  1.  •••,/i). 

It  is  noted  here  that  the  real  part  of  the  image  curve  bo(0 
is  the  same  as  the  “spherical  Bezier  curve”  presented  by  Shoe- 
make  (1985).  Pottmann  (1992)  has  also  pointed  out  the  lack 
of  subdivision  property  of  the  Bezier  curves  described  in  Shoe- 
make  (1985). 


4  Bernstein-Bezier  Polynomial  Motions 

This  section  develops  a  deCasteljau  type  algorithm  for  Bezier 
motions  which  have  Bernstein  polynomi^  basis  functions.  Such 
Bernstein-Bezier  type  motions  have  been  developed  analytically 
in  our  companion  paper  (Ge  and  Ravani,  1994)  and  have  the 
useful  subdivision  property. 

From  Ge  and  Ravani  (1994),  we  know  that  a  unifold  image 
curve  in  the  image  space  E  may  be  expressed  in  terms  of  non- 
normalized  coordinates  (denoted  by  W(0)  or,  equivalently, 
in  terms  of  unit-normalized  coordinates  (denoted  by  X(0)- 
They  are  related  by  W(/)  =  w(GX(0  where  w(0  is  a  dual¬ 
number  normalizing  function.  The  normalizing  function 
w(0  may  be  interpreted  geometrically  as  a  projection  operator . 
A  general  curve  W  (0  in  the  image  space  E  does  not  necessarily 
represent  a  rigid  body  motion.  It  represents  a  motion  (denoted 
by  X(0)  when  it  has  unit-no rmaUzed  coordinates.  We  shall 
refer  to  W(/)  as  the  preimage  of  X(/.).  This  geometric  inter¬ 
pretation  is  useful  in  explaining  the  subdivision  property  as¬ 
sociated  with  Bezier  polynomial  image  curves  to  be  developed 
in  this  section. 

The  Bezier  cubic  curves  in  the  image  space  E  are  given  by 
(Ge  and  Ravani,  1994) 
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wobo 

W(/)  =  H’(/)X(/)  =  [l  r ’T'c*  (25) 

M’2D2 

1^383 

where  b,-  (/=  1,  2,  3,  4)  denote  the  B6zier  control  points  in  E 
that  represent  the  key  configurations,  and  vv,-  (/=1,  2,  3,  4) 
denote  the  dual-number  weight  factors.  Eq.  925)  can  also  be 
expressed  in  terms  of  Bernstein  polynomials 

3 

W(r)  =  H’(t)X(f)  =  2’^<b>^(0 
1=0 

and  is  said  to  define  Bernstein-Bizier  cubic  motions. 

Similar  to  a  Bernstein-Bezier  curve  in  a  Euclidean  three- 
space  £’,  a  Bernstein-Bezier  unifold  image  curve  X(/)  given 
by  Eq.  (25)  can  be  alternatively  generated  by  repeated  linear 
interpolation  of  its  control  image  points.  Given  the  control 
points  bo  and  the  weights  Wj  (/=0, 1,  •  •  •,  n),  the  deC^teljau 
algorithm  for  constructing  a  Bernstein-Bezier  polynomial  mo¬ 
tion  (of  degree  n)  proceeds  as  follows; 

(7)  For  /=0,  1,  set  3^=ti-,b,; 

(2)  For  r=  1,2,  ••  •,nand/=0,  1, 

= (1  -  r)  Wr '  +  (26) 

(3)  Then  X(/)=Wo/lWol  where  IWol  denotes  the  mag- 
nitude  of  WJ. 

In  the  above,  each  intermediate  point  6/  is  given  by 

This  deCasteljau  algorithm  may  be  used  to  subdivide  a  Bern¬ 
stein-Bezier  image  curve.  The  algorithm  subdivides  the  prei¬ 
mage  W(r)  of  the  image  curveX(/).  The  intermediate  preimage 
points,  may  be  projected  onto  the  curved  image  space 
to  provide  us  with  the  control  polygons  for  the  “left”  and 
“right”  curve  segment. 

One  important  drawback  for  the  above  Bernstein-Bezier  ino- 
tion  is  that  it  is  difficult  to  adjust  the  speed  of  the  motion 
intuitively,  for  it  is  very  difficult  (if  not  impossible)  to  select 
the  constant  weights  w/  such  that  the  linear  motion  of  the  form 
(26)  has  constant  angular  velocity  or  pitch  (Ge  and  Ravani, 
1994). 


Conclusions 

This  paper  has  provided  a  theoretical  foundation  for  con¬ 
struction  of  deCasteljau  algorithms  for  motion  approximation. 
It  has  shown  that  Bezier  motions  obtained  by  the  application 
of  deCasteljau's  construction  are  not  of  Bernstein  form  and 
lack  the  subdivision  property  of  Bernstein-Bezier  cu^es  used 
in  CAGD.  Certain  differential  and  kinematic  properties  of  the 
Bdzier  motions  have  also  be  studied.  An  alternative  form  of 
deCasteljau  algorithm  was  presented  that  results  in  Bernstein- 
Bezier  motions  with  subdivision  property.  The  results  have 
applications  in  motion  animation,  kinematics  and  CAD/CAM. 


Acknowledgment 

The  first  author  gratefully  acknowledges  the  support  by 
National  Science  Foundation  (NSF)  Research  Intiation  Award 
MSS-9396265  to  the  State  University  of  New  York  at  Stony 
Brook.  The  second  author  gratefully  acknowledges  the  support 
by  NSF  grant  DMC-8796348  and  the  US  Army  Research  Office 
grant  DAAL03-90-0(X)5  to  the  University  of  California  at  Davis. 


References 

Barsky.  B.  A.,  and  DeRosc,  T.  D.,  1989,  "GcomctricConiinuity  of  Parametric 

754  /  Vol.  116,  SEPTEMBER  1994 


Curves:  Three  Equivalent  Characterizations,”  IEEE  Computer  Graphics  and 
Applications^  Vol.  9,  No.  6,  pp.  60-68. 

Boehm,  W.,  1987,  “Smooth  Curves  and  Surfaces,”  Geometric  Modeling: . 
Algorithms  and  New  Trends,  G.  Farin.  ed.,  SIAM,  Philadelphia,  PA,  pp.  175- 
184. 

Boiiema,  O.,  and  Roth,  B.,  1979,  Theoretical  Kinematics,  North  Holland 
publ.,  Amsterdam,  pp.  51-60,  150-152,  521-523. 

Duff.  T.,  1986,  “Quaternion  Splines  for  Animating  Orientation,”  Technical 
Report,  AT&T  Bell  Laboratories. 

Farin.  G.,  1993,  Curves  and  Surfaces  for  CAGD:  A  Practical  Guide,  3rd  ed., 
Academic  Press,  San  Diego,  473  pp. 

Flanders.  H.,  1963,  Differential  Forms  with  Application  to  the  Physical  Sci¬ 
ences,  Academic  Press,  New  York,  205  pp. 

Ge,  Q.  J.,  and  Ravani,  B.,  1994,  “Computer  Aided  Geometric  Design  of 
Motion  Inter polants,”  ASME  Journal  of  Mechanical  Design. 

McCarthy,  J.  M..  and  Ravani.  B..  1986,  “Differential  Kinematics  of  Spherical 
and  Spatial  Motions  using  Kinematic  Mapping,”  ASME  Journal  of  Applied 
Mechanics,  Vol.  53,  pp.  15-22. 

McCarthy,  J.  M.,  1990,  Introduction  to  Theoretical  Kinematia,  MIT  Press, 
Cambridge,  MA,  pp.  19-22,  60-63. 

Pleiinckx,  D.,  1989,  Quaternion  Calculus  as  a  Basic  Tool  in  Computer  Graph¬ 
ics,*'  The  Visual  Computer,  Vol.  5,  pp.  2-13. 

Potimann,  H.,  1992,  Personal  Communications. 

Ravani,  B.,  and  Roth.  B.,  1984,  “Mappings  of  Spatial  Kinematics,  ASME 
Journal  of  Mechanisms,  Transmissions,  and  Automation  in  Design,  Vol. 
106,  No.  3,  pp.  341-347. 

Shoemake,  K.,  1985,  “Animating  Rotation  with  Quatermon  Curves.  ACM 
Siggraph,  Vol.  19.  No.  3.  pp.  245-254. 

Sommcrville,  D.  M.  Y.,  1914,  Elements  of  Non-Euclidean  Geometry,  Dover 
Publ  (reprint  1958). 

Taylor,  R.  H.,  1979,  “Planning  and  Execution  of  Straight-Line  Mampulator 
Trajectories,”  IBM  J.  of  Research  and  Development,  Vol.  23,  No.  4,  pp.  253- 
264. 


APPENDIX  A 

This  appendix  proves  that  the  deCasteljau  algorithm  for 
constructing  the  nonalgebraic  Bezier  image  curves  does  not 
possess  the  subdivision  property.  For  convenience,  we  consider 
only  the  construction  of  a  nonalgebraic  Bezier  image  curve  of 
rank  2  using  three  control  points. 

Let  bo,  bi,  $2  be  the  Bezier  control  points  in  the  image  space 
E  and  let  s,  t  be  two  parametes  such  that  0<$<t<\.  Then 
the  deCasteljau  algorithm  yields  two  parametric  points  on  the 
Bezier  image  curve  (Fig.  9): 

b§  (5)  =  bj  (5)  I  ( bj  (^ ) )  -  '61  ( J)  r.  (27) 

b?(0=6o(0['(bi(/))”'bl(f)]'-  (28) 

Let  £  denote  the  intersection  point  of  the  two  lines  joining 
hois)  to  h\is)  and  bj(f)  to  bl(/).  If  the  subdivision  property 
exists  for  the  deCasteljau  algorithm,  then  the  point  bo(5)  can 
be  alternatively  generated  by  subdividing  the  triangle  formed 
by  the  three  points  bo,  bo(f),  and  bo(f)  in  the  ratio  is/t):l- 
This  leads  to  the  following 

c  =  bi(/)[(bi(0)"‘b?(0r^  (29) 

hl{s)  =  blis)l(His))-^tY^',  (30) 

Substitute  (28)  into  (29)  to  obtain 


Fig.  9  A  deCasteljau  algorithm  In  Z  does  not  possess  subdivision  prop¬ 
erty 

Transactions  of  the  ASME 


e=bi(/)((bi(0)''6|(or. 


(31) 


From  (27)  and  (30),  we  obtain 

£  =  bi(j)((bi(i))-'81(5)r.  (32) 

In  what  follows  we  show  that  in  general  (31)  and  (32)  cannot 
hold  simultaneously  and  therefore  conclude  that  the  de- 
Casteljau  algorithm  in  E  does  not  in  general  possess  subdivision 
property.  Let  denote  the  dual  angular  lengths  of  the 
segments  bo(5)bl(5)  and  bo(/)bl(/),  respectjvely^  Then  Eqs. 
(31),  (32)  mean  that  c  divides  the  segment  bo(/)bi(/)  in  the 
ratio  ^  sin  (5^',)/sin((l --S')^',)  and  divides  the  segment 
bo(5’)b}(s)  in  the  ratio  sin  (/^^)/sinj[(l  -*/)^^),  respectively.  If 
this  is^true  then  the  triangle  bo(f)bibl(0  and  its  transversal 
bo(^)bj  (5)  must  satisfy  the  Menalaus’  theorem  of  collinearity: 


sin((l->5)^o)  sin(^;)  sin  ( (r-5)»t)^ 

sin  ((/~5)^o)  sin((l -5)^^,)  sin  (500  ’  ^ 

where  0o»  are  the  angular  lengths  of  the  segments  bobj, 
6162,  respectively  (Details  on  Menelaus’  theorem  in  non-Eu- 
clidean  geometry  can  be  found  in  Sommerville,  1914).  In  the 
special  case  when  0o  =  </>i  =  O»  i-e.,  when  three  given  configu¬ 
rations  share  the  same  orientation,  Eq.  (33)  holds  and  it  reduces 
to: 

1-^  ^  t-s  ^ 

t-s  1-5  5 

by  virtue  of  Eq.  (9).  In  general,  however,  Eq.  (33)  does  not 
hold.  Thus  we  conclude  that  (31)  and  (32)  do  not  hold  si¬ 
multaneously  for  arbitrary  spaced  control  points  bo,  bi,  and 
62.  This  completes  the  proof. 
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In  this  article  we  generalize  the  concept  of  Bezier  curves  to  curved  spaces,  and 
illustrate  this  generalization  with  an  application  in  kinematics.  We  show  how  De 
Casteljau 's  algorithm  for  constructing  Bezier  curves  can  be  extended  in  a  natural 
way  to  Riemannian  manifolds.  We  then  consider  a  special  class  of  Riemannian 
manifold,  the  Lie  groups.  Because  of  their  group  structure  Lie  groups  admit  an 
elegant,  efficient  recursive  algorithm  for  constructing  Bezier  curves.  Spatial  dis¬ 
placements  of  a  rigid  body  also  form  a  Lie  group,  and  can  therefore  be  interpolated 
{in  the  Bezier  sense)  using  this  recursive  algorithm.  We  apply  this  alogorithm  to 
the  kinematic  problem  of  trajectory  generation  or  motion  interpolation  for  a  moving 
rigid  body.  The  orientation  trajectory  of  motions  generated  in  this  way  have  the 
important  property  of  being  invariant  with  respect  to  choices  of  inertial  and  body- 
fixed  reference  frames. 


1  Introduction 

One  of  the  cornerstones  of  geometric  design  has  been  the 
work  of  P.  Bezier  on  the  free-form  curve  design  method  that 
bears  his  name.  Bezier  curves  are  not  only  flexible  and  easy 
to  generate,  but  offer  a  simple  geometric  representation  of  a 
curve  in  terms  of  its  control  polygon,  as  well  as  a  firm  math¬ 
ematical  foundation  based  on  Bernstein  polynomials.  In  ap¬ 
plying  the  Bezier  method  to  the  kinematic  problem  of  trajectory 
generation  or  motion  interpolation  for  a  moving  rigid  body, 
however,  the  classical  geometric  design  techniques  need  to  be 
extended  to  curved  spaces.  In  principle  one  can  obtain  a  col¬ 
lection  of  local  coordinate  charts  for  a  given  curved  space,  and 
apply  existing  Euclidean  interpolation  techniques  to  these  co¬ 
ordinates.  The  resulting  curves,  however,  will  depend  on  the 
choice  of  local  coordinates,  which  clearly  leaves  something  to 
be  desired  from  both  a  mathematical  as  well  as  an  engineering 
perspective.  Another  requirement  motivated  by  the  moving 
rigid  body  problem  is  that,  to  the  extent  possible,  the  resulting 
motions  should  not  depend  on  the  choice  of  inertial  or  body- 
fixed  reference  frames;  in  the  language  of  Lie  groups  this  can 
be  phrased  as  the  question  of  whether  a  group  admits  a  bi¬ 
invariant  Riemannian  metric.  Using  standard  results  from  Lie 
theory  it  can  be  shown  that  bi-invariant  orientation  trajectories 
can  be  constructed,  but  that  in  general  there  is  no  bi-invariant 
metric  for  the  spatial  displacements  (see,  e.g.,  Park  et  al., 
1993). 

The  goal  of  this  article  is  to  generalize  the  concept  of  Bezier 
curves  to  curved  spaces.  The  existing  theory  of  Bezier  curves 
has  been  formulated  only  for  curves  in  Euclidean  space,  and 
only  recently  have  attempts  been  made  at  extending  Bezier’s 
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original  construction  to  particular  curved  spaces.  Shoemake 
(1985)  presents  a  class  of  methods  for  generating  curves  on 
rotations  that  are  based  on  unit  quaternion  representation. 
Although  unit  quaternions  have  certain  well-known  advantages 
over  other  representations  of  rotations  (e.g.,  Euler  angles), 
Shoemake’ s  approach  is  essentially  coordinate  dependent:  the 
resulting  motions  are  not  invariant  with  respect  to  choice  of 
inertial  and  body-fixed  frames,  and  his  methods  do  not  ade¬ 
quately  address  the  underlying  geometry  of  the  space  of  ro¬ 
tations  (e.g.,  the  2-1  nature  of  the  unit  quaternion 
representation).  In  more  recent  work,  Ge  and  Ravani  (1991, 
1994)  have  recognized  these  geometric  issues,  and  formulated 
rigid  body  motion  interpolation  in  terms  of  the  image  space 
manifold  of  kinematic  mapping  (see  e.g.,  Bottema  and  Roth, 
1979  or  Ravani  and  Roth,  1984).  In  particular,  the  2-1. am¬ 
biguity  is  resolved  by  defining  an  orientation  on  the  space  of 
unit  quaternions,  and  trajectories  are  constructed  in  terms  of 
the  first  fundamental  form  of  the  image  space  with  bi-invariant 
rotation  parts.  Juettler  (1994)  has  provided  a  theoretical  eval¬ 
uation  of  several  approaches  for  motion  interpolation  and  has 
discussed  coordinate  frame  dependency  of  some  of  these  ap¬ 
proaches. 

In  this  article,  we  formulate  a  general  framework  for  con¬ 
structing  Bezier  curves  on  Riemannian  manifolds,  and  then 
focus  specifically  on  a  special  class  of  Riemannian  manifold, 
the  compact  Lie  groups.  This  class  covers  a  wide  range  of 
curved  spaces  that  arise  frequently  in  kinematics:  for  example, 
the  group  of  proper  rotations  SO(n)  lies  at  the  heart  of  the 
rigid-body  motion  interpolation  problem,  and  the  special  un¬ 
itary  group  SU(2)  can  also  be  identified  with  unit  quaternions. 

This  article  is  organized  as  follows.  In  Section  2  we  show 
how  Bezier  curves  can  be  generalized  to  arbitrary  Riemannian 
manifolds,  and  discuss  some  of  the  computational  issues.  In 


Section  3  we  present  a  method  of  constructing  Bezier  curves 
on  compact  Lie  groups,  based  on  a  generalization  of  De  Cas- 
teljau’s  algorithm.  In  Section  4  we  apply  these  curve  generation 
techniques  to  the  design  of  smooth  trajectories  on  the  rigid- 
body  motions. 


2  Bezier  Curves  on  Riemannian  Manifolds 

2.1  A  Geometric  Interpretation  of  Classical  Bezier 
Curves,  Before  laying  down  the  mathematical  framework  for 
Riemannian  manifolds,  it  is  instructive  to  review  the  geometric 
interpretation  of  Bezier  curves  from  the  perspective  of  both 
Bezier’s  original  construction  and  of  De  Casteljau’s  algorithm 
(1963),  which,  incidentally,  precedes  Bezier’s  work.  Bezier 
(1963)  geometrically  defines  a  curve  from  four  ordered  ver¬ 
texes — the  polygon  formed  by  these  vertexes  is  called  the  con¬ 
trol  polygon — subject  to  a  set  of  geometric  constraints. 
Specifically,  let  the  ordered  vertices  be  {(0,  0,  0),  (1,  0,  0),  (1, 
0,  1),  (1,  1,  1)1,  and  let  C:  [0,  1]  — (R^  be  a  curve  such  that 

•  e(0)  =  (0,  0,  0)  and  6(1)  =  (1,  1,  1). 

•  The  tangent  at  6(0)  is  parallel  to  the  Jt-axis,  and  the 
tangent  at  6(1)  is  parallel  to  the  z-axis. 

•  The  osculating  plane  at  6(0)  is  parallel  to  the  x-y  plane, 
and  the  osculating  plane  at  6(1)  is  parallel  to  the  y-z 
plane. 

The  curve  satisfying  these  constraints  is  given  by  a  cubic  pol¬ 
ynomial.  Reversing  the  order  of  the  vertexes  results  in  the  same 
curve.  It  is  important  to  note  that  the  resulting  curve  is  de¬ 
termined  entirely  from  geometric  constraints,  and  that  the 
method  of  construction  can  be  extended  naturally  to  an  ar¬ 
bitrary  number  of  vertexes  arranged  randomly  in  Euclidean 
space.  The  Bezier  curve  can  also  be  viewed  as  an  approximation 
to  the  control  polygon  in  terms  of  Bernstein  polynomials  (as 
discovered  later  by  R.  Forrest,  1972). 

De  Casteljau  presents  another  method  of  constructing  a 
smooth  curve  given  a  control  polygon,  which  turns  out  to  be 
identical  to  the  Bezier  curve,  but  directly  exploits  the  rela¬ 
tionship  with  Berstein  polynomials.  This  algorithm  is  perhaps 
best  illustrated  by  Fig.  1.  Specifically,  let  the  ordered  set  of 
vertices  in  (R^  be  given  by  {po,  P\,  .  .  ,  ,  Pnlt  and  define  the 
polynomials 

pf(t)=^(l-t)PiSi^  +  tpf-^ 

where  pj-p,.  The  curve  given  by  /?J(0,  0  <  t  <  1  then 
corresponds  to  the  Bezier  curve.  Geometrically  De  Casteljau’s 
algorithm  constructs  the  curve  by  successive  linear  interpola¬ 
tion  between  the  vertexes  of  the  control  polygon. 

That  Bezier’s  original  construction  and  De  Casteljau’s  al¬ 
gorithm  are  equivalent  is  remarkable,  and  can  fundamentally 
be  traced  to  the  fact  that  the  curve  lies  in  Euclidean  space. 
The  polynomial  representation  of  these  curves,  which  makes 
them  computationally  attractive,  is  also  due  to  the  underlying 
space  being  Euclidean.  In  order  to  generalize  these  two  methods 
of  curve  construction  to  curved  spaces  one  must  first  generalize 
the  underlying  geometric  concepts.  In  the  De  Casteljau  method 
the  concept  of  linear  interpolation  between  two  points  in  a 
curved  space  needs  to  be  defined;  this  can  be  readily  done  on 
a  Riemannian  manifold,  where  the  minimal  geodesic  plays  the 
role  of  the  straight  line  for  curved  spaces,  and  lengths  can  be 
measured  in  terms  of  the  Riemannian  metric.  Bezier’s  con¬ 
struction,  however,  does  not  seem  to  generalize  in  a  natural 
way  to  the  Riemannian  setting.  Although  tangency  between 
curves  is  well-defined,  the  notion  of  an  osculating  plane  relies 
inherently  on  the  manifold  being  embedded  in  some  larger 
ambient  Euclidean  space,  and  in  general  there  is  no  natural 
way  to  do  this.  It  is  also  more  desirable  to  define  a  Bezier 
curve  in  terms  of  the  intrinsic  geometry  of  the  manifold,  rather 
than  the  underlying  space  in  which  it  lies.  For  Riemannian 
manifolds,  therefore,  the  natural  way  to  define  Bezier  curves 


Fig.  1  The  De  Casteljau  algorithm  for  W  =  4 


is  by  generalizing  De  Casteljau’s  algorithm.  Naturally  for  cer¬ 
tain  manifolds  the  minimal  geodesic  between  two  points  may 
not  always  be  unique,  so  that  a  number  of  subtleties  (addressed 
below)  will  arise. 

Before  proceeding  some  clarification  on  what  we  mean  by 
“geometric”  may  help  justify  our  emphasis  on  the  so-called 
“geometric”  construction  of  Bezier  curves.  In  principle  any 
differentiable  manifold  of  dimension  n  can  be  locally  repre¬ 
sented  by  a  set  of  coordinates  (xi, .  .  .  ,  jc„).  One  might  there¬ 
fore  be  tempted  to  simply  construct  Bezier  curves  in  terms  of 
these  local  coordinates,  and  regard  the  corresponding  curve 
on  the  manifold  as  the  generalized  Bezier  curve.  The  flaw  with 
this  construction,  of  course,  is  that  the  resulting  curves  depend 
on  the  choice  of  local  coordinates.  Any  geometric  scheme  for 
generalizing  Bezier  curves  must  by  definition  be  coordinate- 
invariant.  For  Lie  groups  we  shall  impose  an  additional  re¬ 
quirement  of  bi-invariance. 


2.2  Bezier  Curves  on  Riemannian  Manifolds.  Clearly  the 
key  to  generalizing  De  Casteljau’s  algorithm  to  curved  spaces 
is  the  notion  of  linear  interpolation,  or  even  more  fundamen¬ 
tally,  lines.  On  a  Riemannian  manifold  the  geodesics  (with 
respect  to  the  given  Riemannian  metric)  play  the  role  of  lines. 
We  now  formally  review  these  concepts. 

Let  A/ be  a  Riemannian  manifold  of  dimension  n,  with  local 
coordinates  (xi,  X2,  ...»  x„),  and  Riemannian  metric 
ds^=gij{x)dXidXj  (we  adopt  the  physicists’  convention  of  sum¬ 
mation  over  repeated  indices).  If  a  curve  6  on  A/  is  given  in 
local  coordinates  by;c(/),0<^<l,  then  the  length  of  the 
curve  is  given  by  the  integral 


Just  as  a  line  in  Euclidean  space  can  be  considered  as  the 
shortest  path  between  two  points,  on  a  Riemannian  manifold 
the  minimum  length  curve  joining  two  points  can  be  regarded 
as  the  analog  of  the  straight  line.  A  parametrized  curve  6  is 
a  geodesic  if  it  is  a  critical  point  of  the  energy  functional 


E 


dXj  dxj 

lilt 


dt 


and  is  a  minimal  geodesic  if  it  minimizes  E,  Observe  that  the 
integrands  of  E  and  L  differ  by  a  square:  the  length  L  is 
invariant  with  respect  to  reparametrizations  of  6,  whereas  E 
clearly  depends  on  the  parametrization.  Interestingly,  the  curves 
minimizing  E  also  minimize  L,  and  moreover  are  parametrized 
with  respect  to  arc-length.  In  local  coordinates  the  geodesics 
satisfy  the  system  of  differential  equations 


(1) 


for  1  <  /  <  where 


p//  ( dg/y  dgji\ 

\dxj  dx,  dxj 


i 


and  (g“)  =  The  rj*.  are  known  as  the  Christoff  el  sym¬ 

bols  of  the  second  kind. 

Local  existence  and  uniqueness  of  solutions  for  the  Euler- 
Lagrange  equations  of  geodesics  can  be  shown  for  a  given  set 
of  initial  conditions  (see,  e.g.,  Gallot  et  al.,  1990).  This  result 
implies  that  any  two  points  which  are  “close  enough”  to  each 
other  are  joined  by  a  unique  minima!  length  geodesic.  This 
property  is  generally  not  global;  for  example,  the  geodesics  on 
the  two-sphere  are  given  by  the  great  circles,  and  any  two 
antipodal  points  will  have  any  number  of  minimal  geodesics. 
If  any  geodesic  on  Af  can  be  extended  to  a  geodesic  defined 
on  all  of  (R,  then  the  manifold  is  said  to  be  geodesically  com¬ 
plete.  A  theorem  of  Hopf-Rinow  asserts  that  if  a  manifold  is 
geodesically  complete,  then  any  two  points  of  A/ can  be  joined 
by  a  minimal  geodesic;  note  that  the  theorem  does  not  claim 
that  this  geodesic  is  unique.  For  the  purposes  of  this  paper  we 
shall  only  consider  geodesically  complete  manifolds.  The  in¬ 
terested  reader  is  referred  to  Gallot  et  al,  ( 1 990)  for  the  technical 
requirements  of  such  manifolds. 

Having  established  the  minimal  geodesics  as  the  Riemannian 
analog  of  straight  lines,  Bezier  curves  can  now  be  easily  gen¬ 
eralized  via  De  Casteljau’s  algorithm.  Let  the  n  ordered  points 
of  M  forming  the  control  polygon  be  labelled  [po,  pu  , 
Pn]-  Represent  the  minimal  geodesic  between  any  two  points 
p,  q  (l  Mhy  the  curve  exp(p,  r),  0  <  r  <  1,  where  exp(/7, 
Q,0)  -  p  and  exp(p,  q,  1)  =  q.  Define  the  sequence  of  curves 
on  M 

A*(/)=exp(pfr,'  U),pf-'  it),  t) 

Here  p%t)  =  pj.  The  Bezier  curve  is  then  given  by 

p^(r)=exp(p;:|(/),pr‘(0,  0 

which  is  analogous  to  the  De  Casteljau  algorithm  for  Euclidean 
space. 

It  is  clear  that  constructing  Bezier  curves  on  Riemannian 
manifolds  by  this  algorithm  is  computationally  more  involved 
than  for  the  Euclidean  case:  computing  the  geodesic  between 
any  two  points  involves  the  solution  of  the  nonlinear  differ¬ 
ential  equation  ([!]),  a  two-point  boundary  value  problem  (and 
therefore  more  difficult  than  integrating  a  differential  equation 
with  only  initial  conditions).  Even  if  we  assume  that  the  geo¬ 
desics  forming  the  control  polygon  have  been  precomputed 
and  stored  in  a  table,  for  each  instant  t  the  geodesic  equations 
still  need  to  be  solved  (n  -  1)(;2  -2)/2  times.  Clearly  this 
presents  difficulties  for  interactive  design  applications. 

One  method  of  obtaining  approximate  Bezier  curves  in  real¬ 
time  is  to  compute  only  a  discrete  set  of  points  on  the  curve, 
and  to  interpolate  between  these  points  using  local  coordinates. 
Specifically,  the  Bezier  curve  p"„(t)  can  be  sampled  /V*  times  at 
uniformly-spaced  intervals  of  t  using  the  algorithm  given  above. 
Labelling  these  points  P(/),  /  =  1,  .  .  .  ,  /y,  one  can  then 
obtain  smooth  interpolants  between  adjacent  points  P{i  —  1) 
and  P{i)  in  terms  of  local  coordinates-  While  any  convenient 
interpolating  spline  can  be  chosen,  care  must  be  taken  to  ensure 
continuity  of  the  proper  order  at  the  knot  points.  The  obvious 
drawback  of  this  approach,  of  course,  is  that  the  resulting 
curve  depends  on  the  choice  of  local  coordinates.  Nevertheless, 
it  is  a  practical  means  of  designing  curves  on  Riemannian 
manifolds  in  real-time  that  are  nearly  coordinate-invariant. 

3  Bezier  Curves  on  Lie  Groups 

3.1  Lie  Groups.  We  now  specialize  to  a  special  class  of 
Riemannian  manifold,  the  matrix  Lie  groups.  A  Lie  group ^  G 
is  a  differentiable  manifold  and  an  algebraic  group  whose 
operation  (x, y)— is  smooth.  Some  well-known  examples 
of  Lie  groups  include  Gl(n),  the  general  linear  group  of  nxn 


For  a  comprehensive  account  of  applications  of  Lie  groups  in  kinematics  the 
reader  is  referred  to  Karger  and  Novak  (1985). 


real  nonsingular  matrices,  and  SI(n),  the  special  linear  group 
of  nxn  real  nonsingular  matrices  of  unit  determinant. 

Letp  be  a  point  on  a  matrix  Lie  group  G,  and  Ar(  f )  a  smooth 
curve  on  G  defined  over  sqme  open  interval  of  0  such  that 
^(0)  =  p.  The  derivative  ^(0)  is  said  to  be  a  tangent  vector 
to  G  at  p\  the  set  of  all  tangent  vectors  at  p,  denoted  T^G, 
forms  a  vector  space,  called  the  tangent  space  to  G  at  p.  The 
tangent  space  at  the  identity  p  =  /  is  given  a  special  name, 
called  the  Lie  algebra  of  G,  and  denoted  by  a  lower-case  g. 
On  a  matrix  Lie  group  the  Lie  algebra  is  also  given  by  matrices. 
For  example,  the  Lie  algebra  of  SO(3),  denoted  so(3),  is  the 
set  of  3  X  3  real  skew-symmetric  matrices  (see,  e.g.,  Belinfante 
and  Kolman,  1972). 

More  generally  a  Lie  algebra  is  a  vector  space,  V,  together 
with  a  bilinear  map  VxV-V  (called  the  Lie  bracket) 
that  satisfies,  for  any  AT,  T,  Z  €  V,  (/)  [X,  =  0,  and  (//) 

[Xy  [y,  2]]-i-[Z,  [Xy  y]]  +  [y,  (Z,  =  O.  For  matrix., Lie 

algebras  the  Lie  bracket  is  given  by  the  matrix  commutator; 
if  A",  y  €  g  are  square  matrices,  then  [Xy  Y]  =  XY  -  YX. 

Defined  on  each  Lie  algebra  is  the  exponential  mapping  into 
the  corresponding  Lie  group.  On  matrix  groups  the  exponential 
mapping  corresponds  to  the  usual  matrix  exponential,  i.e.,  if 
A^gy  then  txp  A  ^  I  ^  A  ^  A^/2l  +  ...  is  an  element  of 
G.  Observe  that  f  €  (R,  itself  forms  a  group,  in  this  case 
a  subgroup  of  the  Lie  group.  Such  groups  are  called  one- 
parameter  subgroups  of  a  Lie  group,  and  play  a  special  role 
in  the  description  of  minimal-length  paths  on  Lie  groups  as 
we  show  below.  Before  lengths  of  paths  can  be  defined  we 
first  need  to  examine  Riemannian  metrics  on  Lie  groups. 

If  -y(0  is  a  differentiable  curve  on  G  as  before,  then  X  € 
7^G.  One  can  exploit  the  group  structure  of  G  to  represent  X 
as  an  element  of  the  Lie  algebra  as  follows.  Let  g,  /i  €  G,  and 
define  the  left  and  right  translation  maps  Lk-\:  G—Q, 

G— G  byL;,-i(g)  =  /2“*g  and  i?/,- 1(g)  =  g/i “ ^  respectively. 
It  follows  that  their  derivatives  dLh- 1  and  dRh-i  are  mappings 
from  G  to  Tfi  =  g.  By  applying  these  two  maps  to  X 
it  can  be  seen  that  X^^  X  and  XX^^  are  elements  of  g.  Any 
tangent  vector  can  therefore  be  identified  with  an  element  of 
g  by  either  left  or  right  translation.  Since  g  is  a  vector  space, 
any  inner  product  on  g  will  define  a  Riemannian  metric  for 
G. 

Let  Q  be  a  quadratic  form  on  g,  and  X{t)  a  curve  on  G. 
Then  X^TxGy  and  X^  ^X  A  and  XX'  ^  A  Vr  are  elements 
of  g.  Observe  that  Vr  and  are  related  by  Vr  -  XV^X^^ 
4  Ad;^^).  Q  defines  an  inner  product  on  g,  which  in  turn 
defines  two  classes  of  Riemannian  metric  on  G: 

k  \  vlQyi 

<  • ,  ‘  >z  and  <  * ,  ■  >j?  are  the  left-  and  right-invariant  Riemannian 
metrics  on  G  defined  by  Q.  If  <•,•>£  =  <•,•>;?  the  metric  is 
said  to  be  bi-invariant.  Any  Lie  group  admits  a  left-  or  right- 
invariant  metric  from  the  construction  above,  but  not  all  Lie 
groups  admit  a  bi-invariant  metric.  One  well-known  condition 
in  which  a  bi-invariant  metric  is  always  guaranteed  to  exist  is 
if  the  Lie  group  is  compact.  In  this  case  the  geodesics  of  G 
(with  respect  to  the  bi-invariant  metric)  are  the  one-parameter 
subgroups  of  G  and  its  translates,  i.e.,  if  ^  €  g  and  r  €  CR, 
then  is  a  geodesic  as  is  and  H  for  any  H  i  G. 

On  compact  G  it  is  known  that  any  g  €  G  lies  on  a  one- 
parameter  subgroup.  Hence,  given  any  two  points  in  G  there 
always  exists  a  geodesic  (with  respect  to  the  bi-invariant  metric) 
connecting  them.  To  find  the  minimal  geodesic  we  must  con¬ 
sider  the  inverse  of  the  exponential  map  exp:  g— G.  If  G  is 
compact  then  it  is  well-known  that  exp  is  onto,  but  typically 


its  inverse  map  will  be  multiple-valued.  We  therefore  define 
log:  G—g  by 

\og{A)=a 

such  that  a^Qa  is  minimal  among  all  possible  a  ^  g  satisfying 
exp(j)  =  A,  The  minimal  geodesic  between  A  and  B  is  then 
given  by  X(t)  =  A  exp(n/),  0  <  /  <  1,  where  fi  =  log(/4'’^ 
B),  Moreover,  if  A  and  B  are  left-translated  by  some  constant 
C to  CA  and  CB,  then  the  minimal  geodesic  is  given  by  CX{t) . 
Similarly,  the  minimal  geodesic  between  AB and  ACisX(t)C, 
These  properties  follow  from  the  bi-invariance  of  the  Rie- 
mannian  metric,  and  can  be  verified  by  an  elementary  calcu¬ 
lation. 


3.2  The  De  Casteljau  Algorithm  on  Compact  Lie 
Groups.  Having  established  that  the  geodesics  on  compact 
Lie  groups  (with  respect  to  the  bi-invariant  Riemannian  metric) 
are  the  one-parameter  subgroups  and  its  translates,  we  now 
describe  the  algorithm  for  generating  Bezier  curves  on  such 
spaces.  Let  G  be  the  Lie  group  with  Lie  algebra  g,  exp:  g— G 
the  exponential  map,  and  log:  G— g  the  inverse  map  that  pro¬ 
vides  the  minimal  norm  value  as  described  above.  Let  Ipo,  pi, 
,  .  ,  ,  p„]  be  the  ordered  set  of  points  in  G  that  form  the 
control  polygon.  The  Bezier  curve  is  now  defined  recursively 
as  before: 

PiU)  =p?r,‘ ( / )exp(/  )(t)]-^pf-\t)),pUl)=Pi 

and  the  Bezier  curve  is  given  by 

/?:(/)  =p::j(Oexp(/  iogt(p;:i)(/)]-Vr‘(0) 

We  now  restrict  our  attention  to  the  rotation  group  SO(3)  in 
the  next  section. 


4  Kinematics  Application:  Bezier  Curves  on  SO(3) 

The  rotation  group  SO(3),  consisting  of  the  3  x  3  real  or¬ 
thogonal  matrices  with  unit  determinant,  forms  a  Lie  group, 
with  its  Lie  algebra  so(3)  given  by  the  vector  space  of  3 x3 
real  skew-symmetric  matrices  of  the  form 


0 

-W3 

Cd2 

Cd3 

0 

-Wl 

-CO2 

0 

The  following  explicit  formulas  for  the  exponential  and  log¬ 
arithm  mappings  on  SO(3)  and  so(3)  are  well-known: 

Lemma  1  Given  ^  €  50(3), 


r  T  sinDojS  ,  l-cosHoji  , 
expM=/+^p-;— H+— ;-;;3 - [o)]’ 


Icol 


where  [u]  is  the  skew-symmetric  matrix  representation,  and 
Icoll  the  standard  Euclidean  norm. 

Lemma  2  Given  ©  6  S0(3)  such  that  TrC©)  - 1.  Then 


where  <i>  satisfies  I  -i-  2cos  =  Tr(0),  1<^(  <  tt.  Furthermore, 
filog  of  = 

Remark  1  When  Tr(0)  =  —  1,  log  0  can  have  two  possible 
values  on  the  closed  ball  of  radius  x;  if  w  is  a  unit  length 
eigenvector  of  0  associated  with  the  eigenvalue  1 ,  then  a  simple 
calculation  shows  that  log  0=  ±x[a>]. 

Remark  2  Lemmas  1  and  2  suggest  the  standard  visualization 
of  S0(3)  as  a  solid  ball  of  radius  x,  centered  at  the  origin  with 
the  antipodal  points  identified;  a  point  w  in  the  ball  represents 
a  rotation  by  an  angle  !Ia)D  about  the  line  passing  from  the 
origin  through  w.  The  rotations  whose  traces  equal  -  1  have 
a  rotation  angle  of  x,  and  correspond  to  points  on  the  boundary 
of  the  solid  ball. 

Remark  3  It  is  clear  from  above  that  the  logarithm  on  S0(3) 


is  multiple-valued:  given  0  6  S0(3)  such  that  H  =  log  0, 
HojII  <x,  then  the  set  of  all  possible  values  of  log  0  is 

[cij]-K2x/t[a)],  /26Z 

where  oj  =  co/Kojil.  This  is  akin  to  the  situation  in  the  complex 
plane,  where  if  is  a  point  on  the  unit  circle  for  some  0  < 
<t>  <  2x,  then  corresponds  to  the  same  point  for  any 

integer  n. 

S0(3)  is  a  compact  Lie  group  and  as  such  admits  a  bi¬ 
invariant  Riemannian  metric  <•,•>.  To  see  how  the  metric  is 
applied,  let  0(0. be  a  curve  on  SO(3),  and  0^*0  =  [cd*],  00”‘ 
=  [cdj.  Then  <0,  0>  =  =  cojJ  coj,  where  c  is  a  positive 

constant  scale  factor;  the  Riemannian  metric  is  therefore  de¬ 
termined  by  the  quadratic  form  Q  -  cl  on  so(3).  For  con¬ 
venience  we  shall  henceforth  set  c  =  1. 

In  terms  of  the  bi-invariant  metric  the  geodesics  on  SO(3) 
are  given  by  the  translates  of  the  one-parameter  subgroups 
A  6  so(3)  and  /  ^  (R.  The  minimal  geodesic  between  two  ele¬ 
ments  01,  02  €  SO(3)  is  given  by  the  curve 

0(O  =  0ieX  0<r<l 
where  ^12  =  log(0r*  02),  or,  equivalently, 

0(/)  =  ^°2»'0i,  0<^<1 

where  Qii  =  log(020r^). 

With  this  simple  characterization  of  the  minimal  geodesics, 
Bezier  curves  can  now  be  constructed  in  a  straightforward 
manner  on  SO(3).  For  example,  with  3  control  points  0o,  0i, 
and  02,  the  Bezier  curve  is  given  by 

0(/)  = 

forO  <  /  <  1,  where Qoi  =  log(0i“^0i) and Q12  =  log(©r*02). 

The  general  case  with  n  control  points  is  as  follows.  Let  0°, 
0?,  .  .  .  ,  0°  be  the  ordered  set  of  control  points,  and  define 

ehn = ef-' 

where  k  ranges  from  1  to  /i,  and  i  from  Oion  -  k.  The  Bezier 
curve  is  then  given  by  0o(O- 

Remark  4  The  above  construction  can  also  be  applied  to 
design  Bezier  curves  in  SE(3),  where  the  one-parameter 
subgroups  are  now  the  screw  motions.  It  is  well-known, 


Fig.  3  A  continuous  motion  interpolating  the  control  configurations 


however,  that  SE(3)  does  not  admit  a  bi-invariant  Riemannian 
metric  (see,  e.g.,  Duffy,  1990),  and  that  the  one-parameter 
subgroups  on  SE(3)  are  no  longer  geodesics  with  respect  to 
any  left-  or  right-invariant  Riemannian  metric.  Rather,  in  this 
case  the  geodesics  on  SE(3)  are  simply  the  projections  of  the 
geodesics  on  CR^  x  SO{3)  (Park,  Murray,  and  McCarthy,  1993). 
Hence,  given  a  particular  left-  or  right-invariant  Riemannian 
metric  on  SE(3),  the  corresponding  Bezier  curve  can  be  con¬ 
structed  by  combining  the  appropriate  Bezier  curves  in  (R^  and 
SO(3).  From  a  physical  viewpoint  this  is  more  appealing,  since 
there  is  nothing  natural  about  the  screw  motions  from  the 
point  of  view  of  dynamics.  In  fact,  in  the  absence  of  external 
forces  one  would  expect  the  motion  of  the  center  of  mass  of 
a  rigid  body  to  be  linear,  while  the  orientation  is  governed  by 
Euler’s  equations, 

4.1  Example.  The  Cartesian  space  trajectory  of  a  robot 
is  to  be  designed  such  that  it  interpolates  between  the  two  end 
configurations  of  the  end-effector  shown  in  Fig.  2.  Four  con¬ 
trol  configurations  are  used  (as  shown  in  Fig.  2)  and  the  tra¬ 
jectory  of  the  end-effector  is  generated  using  the  Bezier  curves 
in  (R^  and  SO(3)  as  discussed  above.  The  resulting  motion  or 
trajectory  of  the  end-effector  is  shown  in  Fig.  3. 


5  Conclusion 

By  generalizing  the  notion  of  straight  lines  to  curved  spaces, 
Bezier  curves  can  be  defined  on  Riemannian  manifolds  by  a 
suitable  generalization  of  De  Casteljau’s  algorithm.  In  the  case 
when  the  Riemannian  manifold  is  a  compact  Lie  group  ap¬ 
pealing  formulas  exist  for  the  minimal  geodesics,  which  are 
given  by  matrix  exponentials.  The  algorithm  has  been  illus¬ 
trated  for  the  particular  case  of  SO(3),  with  explicit  formulas 
given  for  the  matrix  exponential  and  logarithm.  The  resulting 
orientation  trajectories  are  invariant  with  respect  to  the  choice 
of  inertial  or  body-fixed  reference  frames  for  the  rigid  body. 
These  results  have  direct  applications  to  kinematics  and  ani¬ 
mation  of  rigid  body  motions,  as  well  as  to  any  problem  in 
which  the  physical  aspects  are  described  by  a  Lie  group. 
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Abstract 

This  paper  presents  a  variational  method  for  smooth  interpolation  of  the  rotation 
group  S0(3).  By  vievhng  SO(3)  as  a  Lie  group  equipped  with  a  natural  Riemannian 
metric,  we  apply  the  coordinate-invariant  methods  of  Riemannian  geometry  to  construct 
spline  curves  that  are  invariant  vnth  respect  to  both  left  and  right  translations  and  gen¬ 
erate  curves  that  approximately  minimize  a  measure  of  smoothness.  Such  spline  curves, 
in  the  context  of  moving  rigid  bodies,  generate  orientation  trajectories  that  are  indepen¬ 
dent  of  the  choice  of  inertial  or  body-fixed  reference  frames  and  are  smooth.  Based  on 
this  construction,  a  computationally  efficient  algorithm  for  interpolating  multiple  points 
in  SO(3)  is  presented. 

1  Introduction 

One  of  the  frequently  encoimtered  problems  in  applications  ranging  from  computer  graphics 
and  animation  to  robot  trajectory  planning  is  the  interpolation  of  smooth  curves  in  SO  (3), 
the  space  of  rotations.  In  this  paper  we  address  the  following  problem:  given  an  ordered 
set  of  n  rotation  matrices  {Ri,  R2, ...,  iin}  (the  knot  points),  and  a  set  of  n  scalars  ti  < 


t2  <  •  ••  <  tn  (the  knot  times),  find  a  curve  R  :  [ti,tn]  SO (3)  such  that  R(ti)  =  Ri, 
i  =  1,2, . . .  ,n.  Our  goal  is  to  find  a  computationally  efficient  method  of  spline  interpolation 
in  SO(3)  that  produces  reasonably  smooth  curves. 

Well-established  techniques  exist  for  spline  interpolation  in  vector  spaces,  but  for  the 
most  part  these  techniques  have  yet  to  be  generalized  in  a  uniform,  coordinate-free  way  to 
curved  spaces  like  SO  (3).  Cubic  splines,  for  example,  are  useful  because  of  their  simplicity 
and  computational  efficiency,  and  are  further  justified  by  the  physical  argument  that  they 
minimize  the  smoothness  criterion 

j  (1) 

To  generalize  this  approach  to  curved  spaces,  one  might  begin  by  formulating  the  corre¬ 
sponding  second-order  smoothness  measure;  more  precisely,  the  curved  space  operator  that 
plays  the  role  of  the  second  derivative  needs  to  be  defined.  In  principle  one  could  choose 
a  set  pf  local  coordinates  for  the  space  and,  proceeding  exactly  as  in  Euclidean  space,  con- 
struct  cubic  splines  in  terms  of  these  coordinates.  However,  curves  generated  in  this  fashion 
axe  known  to  depend  on  the  choice  of  coordinates,  so  that  this  method  is  geometrically  ill- 
defined.  An  even  more  subtle  issue  that  arises  in  the  SO(3)  case  is  the  question  of  translation 
invariance:  given  two  ordered  sets  {i?i, . . . ,  Rn}  and  {.Ri, . . . ,  Rn}  in  SO(3),  where  Ri  and 
Ri  are  related  by  .Ri  =  QRiS  for  Q,  S  constant  SO(3)  matrices,  it  is  reasonable  to  demand 
that  the  interpolating  curves  through  these  two  sets,  denoted  R{t)  and  R{t),  respectively, 
be  related  by  R{t)  =  QR{t)S.  Physically  this  reflects  the  fact  that  the  choice  of  inertial  and 
body-jfixed  reference  frames  for  a  rigid  body  should  not  influence  the  orientation  trajectory 
of  the  interpolated  motion. 

In  this  paper  we  present  a  variational  method  for  smooth  interpolation  in  SO(3)  that 
is  both  left  and  right  translation  invariant  (or  bi-invariant),  and  generates  curves  that 
approximately  minimize  a  certain  geometrically-defined  measure  of  smoothness.  In  our 
approach  SO  (3)  is  regarded  as  a  Lie  group  with  a  natural  Riemannian  metric,  and  the 
role  of  the  second-order  derivative  operator  is  played  by  the  symmetric  covaxiant  derivative 
compatible  with  this  metric.  On  Riemannian  manifolds  this  covariant  derivative  is  the 
natural  coordinate-invariant  generalization  of  second-order  derivatives  in  Euclidean  space, 
and  in  SO  (3)  the  corresponding  second-order  smoothness  functional  turns  out  to  be  simply 
the  integral  of  the  squaxed-norm  of  the  angular  acceleration.  We  review  the  specific  form 
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of  the  functional  and  its  associated  Euler-Lagrange  equations  for  SO  (3).  In  particular,  our 
analysis  relies  on  using  the  canonical  coordinates  of  the  first  kind  as  local  coordinates  on 
SO  (3).  Because  solving  the  Euler-Lagrange  equations  is  impractical  for  interactive  CAD 
applications,  a  method  based  on  the  canonical  coordinates  is  presented  that  preserves  bi¬ 
invariance,  yet  produces  simple  solutions  that  approximately  minimize  the  second-order 
smoothness  functional.  These  curves,  as  will  be  seen,  properly  reduce  to  the  minimal 
geodesics  under  appropriate  boundary  conditions. 

The  paper  is  organized  as  follows.  In  Section  2  we  review  the  geometry  of  SO(3)  as  a  Lie 
group,  and  derive  the  corresponding  canonical  coordinates  of  the  first  kind.  In  Section  3  we 
express  the  second-order  smoothness  functional  in  terms  of  the  canonical  coordinates,  and 
show  that  under  some  mild  assumptions,  the  optimal  curves  are  given  by  the  exponential 
of  a  cubic  matrix  polynomial.  To  illustrate  the  efficiency  of  this  approach  the  case  of  two 
point  interpolation  is  considered  in  some  detail.  In  Section  4  the  two  point  interpolation 
results  are  extended  to  an  algorithm  for  multiple  point  interpolation.  We  conclude  with 
some  remarks  on  how  the  interpolation  of  rigid  body  motions  might  be  addressed  within 
the  given  geometric  framework. 

Before  proceeding  we  mention  some  of  the  relevant  previous  work  in  SO(3)  interpolation. 
One  of  the  more  widely  cited  approaches  is  the  work  of  Shoemake  (1985),  who  presents  a 
class  of  interpolation  schemes  based  on  the  unit  quaternion  representation  for  rotations. 
While  the  unit  quaternions  are  known  to  provide  a  globally  nonsingular  four-parameter 
representation  for  rotations,  Shoemake’s  algorithm  essentially  applies  existing  Euclidean 
interpolation  techniques  to  this  particular  set  of  coordinates,  so  that  the  resulting  curves 
will  not  in  general  be  bi-invariant  and  the  interpolated  motions  are  not  necessarily  Eu¬ 
clidean.  Also,  some  of  the  expected  characteristics  are  not  preserved  in  the  resulting  Bezier 
curves.  A  more  careful  geometric  analysis  of  Quaternion  curves  is  given  by  Ge  and  Ravani 
(1994a,  1994b),  in  which  the  underlying  curved  geometry  of  the  space  of  quaternions  was 
considered  in  performing  the  interpolation  and  actual  Euclidean  motions  were  generated 
with  a  proper  analysis  and  evaluation  of  the  characteristics  of  the  resulting  Bezier  represen¬ 
tations.  Jutler  (1994)  has  presented  a  similar  investigation  working  with  dual  quaternion 
curves  and  discussing  some  of  the  issues  associated  with  the  dependence  of  the  existing 
methods  on  coordinate  system  used.  Hart,  Francis  and  Kauffman  (1994)  have  .presented 
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an  interesting  method  for  visualization  of  quaternion  curves  representing  three  dimensional 
rotations. 

The  unit  quaternions,  in  fact,  can  be  identified  with  the  Lie  group  SU(2)  of  the  2x2 
special  unitary  matrices,  so  that  the  methods  described  here  can  be  extended  in  an  straight 
forward  manner  to  SU(2).  The  present  circle  of  geometric  ideas  has  also  been  applied  to 
formulate  Bezier  curves  on  SO(3)  and  general  compact  Lie  groups  (Park  and  Ravani  1995). 
Other  relevant  work  in  motion  interpolation  include  the  work  of  Barr  (1993),  Wagner  and 
Jutler  (1994),  Pottmann  and  Wagner  (1993). 

None  of  these  authors,  however,  have  considered  the  problem  of  motion  design  in  the 
general  frame  work  of  minimizing  a  certain  measure  of  smoothness  on  SO  (3).  Furthermore, 
they  have  not  utilized  Riemanian  geometry  to  deal  with  the  inherent  curved  nature  of  the 
underl3dng  space  of  three  dimensional  rotations  producing  interpolated  motions  that  are 
completely  coordinate  independent. 

This  paper  builds  upon  the  formulation  presented  in  Park  and  Ravani  (1995);  but  de¬ 
velops  a  variational  method  for  design  of  cubic  sphnes  for  interpolating  multiple  points  in 
SO(3)  generating  smooth  multi-segment  rotations  which  are  independent  of  the  choice  of 
the  coordinate  system. 

2  The  Geometry  of  SO  (3) 

We  begin  with  a  review  of  the  necessary  background  on  SO(3)  as  a  matrix  Lie  group;  the 
development  closely  parallels  that  of  (Park  and  Ravani  1994),  and  additional  background 
can  be  found  in,  e.p.,  (Belinfante  and  Kohnan  1972). 

SO(3)  as  a  Lie  Group 

Recall  that  SO(3)  is  the  set  of  all  3  x  3  real  orthogonal  matrices  with  unit  determinant. 
SO(3)  has  the  structure  of  a  group  and  a  differentiable  manifold,  and  is  an  example  of  a 
Lie  group.  The  rigid-body  motions  SE(3)  can  also  be  regarded  as  a  Lie  group  under  matrix 
multiplication,  with  elements  of  the  form 

R  b 
0 
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1 


(2) 


where  R  €  S0(3)  and  b  €^.  Some  other  well-known  examples  of  matrix  Lie  groups  include 
Gl(n),  the  general  linear  group  of  n  x  n  real  nonsingular  matrices,  and  the  special  linear 
group  Sl(n),  which  is  a  subgroup  of  Gl(n)  whose  elements  have  unit  determinant. 

More  generally  let  G  denote  a  matrix  Lie  group,  and  let  X  (t)  be  a  differentiable  curve 
in  G  defined  over  some  open  interval  containing  0  such  that  X  (0)  =  p.  The  derivative  X  (0) 
is  said  to  be  a  tangent  vector  to  G  at  p;  the  set  of  all  tangent  vectors  at  p,  denoted  TpG, 
forms  a  vector  space,  called  the  tangent  space  to  G  at  p.  The  tangent  space  at  the  identity 
p  =  7  is  given  a  special  name,  called  the  Lie  algebra  of  G,  and  denoted  by  the  lower-case  g. 
On  SO  (3)  it  is  easily  seen  that  the  Lie  algebra  so(3)  consists  of  the  3x3  skew-symmetric 
matrices:  if  R{t)  is  a  ciurve  in  SO(3)  such  that  R{G)  —  I,  then  differentiating  both  sides  of 
R'^{t)R{t)  =  7,  it  follows  that  .R^(O)  -t-  .R(O)  =  0,  so  that  elements  of  so(3)  are  matrices  of 


the  form 

I 


0 

-rz 

r2 

rs 

0 

-ri 

-r2 

ri 

0 

(3) 


where  r  €  3?^.  Note  that  an  element  [r]  €  so  (3)  can  also  be  represented  as  a  vector  r  € 
if  it  is  clear  firom  the  context  which  representation  is  meant  then  an  element  of  so(3)  will 
simply  be  denoted  as  r. 

More  generally  a  Lie  algebra  is  a  vector  space,  V,  together  with  a  bilinear  map  [•,  •]  : 
V  X  V  — >  V  (called  the  Lie  bracket)  that  satisfies,  for  every  V,  (i)  [77,77]  =  0,  and 

(ii)  [77,  [p,^]]  +  [^,  [77,  pj]  +  [p,  [^,77]]  =  0.  From  (i)  and  the  bilinearity  property  it  follows 
that  [77,  p]  =  —  [p,77].  For  matrix  Lie  groups  and  Lie  algebras  the  corresponding  Lie  bracket 
reduces  to  the  standard  matrix  commutator:  if  77  and  p  are  square  matrices,  then  [77,  p]  = 
77P  —  fjLT].  In  particular,  on  so(3)  it  is  easily  verified  that  the  Lie  bracket  of  two  elements 
corresponds  to  their  vector  product:  [ri,r2]  =  [^ijH]  —  NjiT*!]  =  [^'i  x  ^2]- 


The  Exponential  Mapping 

An  important  connection  between  a  Lie  group  and  its  Lie  algebra  is  the  exponential  mapping-, 
defined  on  each  Lie  algebra  is  the  exponential  mapping  into  the  corresponding  Lie  group. 
On  matrix  groups  the  exponential  mapping  is  given  by  the  usual  matrix  exponential,  i.e., 
if  A  is  an  element  of  the  Lie  algebra,  then  exp  A  =  7-|-A-t-^-i-...is2m  element  of  the  Lie 
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group.  On  so(3)  the  exponential  mapping  is  onto,  i.e.,  for  every  R  G  SO  (3)  there  exists  an 
[r]  G  so(3)  such  that  exp[r]  =  R.  On  SO(3)  and  its  Lie  algebra  well-known  explicit  formulas 
exist  for  the  exponential  and  its  inverse:  if  [r]  G  so(3),  then 


exp[r]  —  I  + 


sm  n 


M  + 


1  —  cos  llrl 


[r]^ 


(4) 


where  ||r||  is  the  standard  Euclidean  norm.  Alternatively,  if  iJ  G  SO  (3)  such  that  Tr(i?) 
—1,  then 


(5) 

2  sm  (p 

where  (j)  satisfies  1  -t-  2  cos  4>  =  Tr(-R)  and  ||  log  i?|p  =  4?'.  In  the  case  when  Tr(i?)  =  — 1  two 
possible  solutions  for  log  R  are  as  follows:  if  f  is  a  unit  length  eigenvector  of  R  associated 
with  the  eigenvalue  1,  then  logii  =  ±7r[r]. 

Prom  the  above  formulas  SO(3)  can  be  visuahzed  as  a  solid  ball  of  radius  tt,  centered  at 
the  origin  with  the  antipodal  points  identified;  a  point  r  in  the  ball  represents  a  rotation  by 
an  angle  ||r||  about  the  line  passing  from  the  origin  through  r.  Conversely,  any  R  G  SO(3) 
can  be  represented  by  the  set  of  points  corresponding  to  log  R.  Note  that  this  representation 
is  unique  when  restricted  to  the  interior  of  the  solid  ball.  In  general,  if  [r]  is  one  solution 
to  log  R,  then  R  =  ^  for  any  integer  k.  The  exponential  mapping  provides  a  set 

of  local  coordinates  for  a  Lie  group  over  a  neighborhood  of  the  identity;  Chevalley  (1946) 
calls  these  coordinates  the  canonical  coordinates  (of  the  first  kind).  On  SO(3)  we  see  that 
the  canonical  coordinates  are  obtained  from  the  logarithm  formula. 

The  Lie  algebra  so(3)  can  be  viewed  as  providing  local  coordinates  for  SO(3)  via  the 
exponential  map.  Another  iiseful  interpretation  involves  angular  velocities.  If  R{t)  is  a  curve 
in  SO  (3)  describing  the  orientation  of  a  rigid  body  relative  to  an  inertial  reference  frame, 
then  it  is  not  difficult  to  see  that  both  RR~^  and  R~^R  are  skew-symmetric,  and  therefore 
elements  of  so(3).  R~^R  is  in  fact  the  angular  velocity  of  the  rigid  body  in  body-fixed  frame 
coordinates,  whereas  RR~^  is  the  angular  velocity  in  inertial  frame  coordinates. 


SO(3)  as  a  Riemannian  Manifold 

Let  A4  be  a  Riemannian  manifold^  of  dimension  n,  with  local  coordinates  {xi,X2,  ■  ■  •,Xn), 
and  Riemannian  metric  ds^  =  gij{x)dxidxj.  If  a  curve  on  M  is  given  in  local  coordinates 

^See  (Gallot  et  al  1990)  for  a  comprehensive  introduction  to  Riemannian  manifolds. 
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by  x(t),  0  <  t  <  1,  then  the  length  of  the  curve  is  given  by  the  integral 

Just  as  a  line  in  Euclidean  space  can  be  considered  as  the  shortest  path  between  two  points, 
on  a  Riemannian  manifold  the  minimum  length  curve  joining  two  points  can  be  regarded 
as  the  analog  of  the  straight  line.  However,  instead  of  L  one  usually  considers  the  energy 

p) 

^■>3 

The  curves  that  minimize  E  are  called  minimal  geodesics.  It  can  be  shown  that  the  curves 
minimizing  E  also  minimize  L,  and  are  automatically  parametrized  according  to  arc-length. 
In  local  coordinates  the  minimal  geodesics  must  satisfy  the  system  of  differential  equations 

d^Xk  pfc  dxi  dxj _ 

di2  ■^4'  dt~ 

for  fc  =  1, 2, . . . ,  n,  where 


pfc  _  1  ki(^9ii  ,  9gji  _  dgij . 


(9) 


and  =  {9ki)~^- 

Clearly  the  minimal  geodesics  depend  strongly  on  the  choice  of  Riemannian  metric. 
In  general  one  cannot  hope  to  find  a  “natural”  Riemannian  metric  for  a  given  manifold, 
in  the  sense  that  the  metric  is  determined  by  the  geometry  of  the  space.  However,  on 
compact  Lie  groups  such  as  SO  (3)  there  does  exist  a  natural  metric  determined  by  the 
requirement  of  bi-invariance.  Recall  from  earlier  that  both  R~^jk  =  [cu;,]  and  RR~^  =  a;^] 
are  elements  of  so(3)  that  correspond  to  the  angular  velocity  in  body-fixed  and  inertial 
frame  coordinates,  respectively.  Since  any  tangent  vector  R  can  be  identified  with  an 
element  of  so(3)  by  either  left  or  right  translation,  any  inner  product  on  so(3)  defines  two 
distinct  Riemannian  metrics  on  SO(3).  Let  this  inner  product  be  given  by  the  symmetric 
positive-definite  quadratic  form  Q.  The  left-invariant  Riemannian  metric  induced  firom 
Q  is  then  {jk,R)i  =  similarly,  the  right-invariant  Riemannian  metric  is  given 

by  {jk,R)r  =  If  then  Q  is  said  to  define  a  bi-invariant 

Riemannian  metric.  Clearly  equality  holds  if  and  only  if  Q  =  cl,  for  c  >  0  any  scalar 


constant. 


Not  all  Lie  groups  have  bi-invariant  Riemannian  metrics  (e.^.,  SE(3)),  but  for  compact 
Lie  groups  like  SO (3)  one  is  always  guaranteed  to  exist.  In  this  case  the  geodesics  (with 
respect  to  the  bi-invariant  metric)  are  the  one-parameter  subgroups  and  its  translates;  on 
SO(3),  for  example,  the  minimal  geodesic  between  Ri  and  i?2  is  given  by 

R{t)  =  ^  0  <  t  <  1  (10) 

where  [ri2]  is  the  minimum  norm  value  of  log(iJj|'^i?2),  and  [r2i]  =  i?i[ri2]ilf . 

3  Two  Point  Interpolation 

Bi-invariant  Solutions 

On  Riemannian  manifolds  second  derivatives  are  generalized  by  the  symmetric  covaxiant 

derivative  compatible  with  the  Riemannian  metric,^  denoted  by  the  symbol  V.  The  equiv- 

1 

alent  energy  functional  to  Equation  (1)  is  then  given  by 

J(x)  =  j (y  a_x,V  a_x)  dt  (11) 

J  dt  dt 

where  x{t)  denotes  the  curve  and  (*,  •)  the  Riemannian  metric.  In  local  coordinates  V  a  i 
is  just  the  left-hand  side  of  Equation  (8),  where  the  Riemannian  metric  is  given  by  gij{x). 
Since  J(i)  is  a  second-order  functional,  four  boundary  conditions  are  required  to  specify  a 
unique  solution.  In  general  the  geodesics  will  not  be  admissible  curves,  but  when  they  are 
the  integrand  vanishes,  so  that  the  geodesics  minimize  J(x), 

The  Euler-Lagrange  equations  for  J(n)  (sometimes  referred  to  as  the  equations  for 
geodetic  deviation)  are 

V^a  a: -1- i?(V  8  x,  x)(x)  =  0  (12) 

(Noakes  et  al  1989,  Milnor  1969)  where  R  is  the  Riemannian  curvature  tensor  of  V.  These 
equations  are  quite  complex  when  expressed  in  local  coordinates.  For  matrix  Lie  groups 
with  a  left-  or  right-invariant  Riemannian  metric  it  is  often  more  convenient  to  derive  the 
equations  directly  from  the  first-order  necessary  conditions.  Specifically,  let  G  and  g  be  the 
matrix  Lie  group  and  its  corresponding  Lie  algebra,  respectively,  and  (•,  •)  an  inner  product 

^Again,  see  (Gallot  et  al  1990)  for  a  complete  discussion  of  covariant  derivatives. 
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on  g  defining  a  left-invariant  metric.  The  objective  then  is  to  find  a  curve  U(t)  in  g  that 
minimizes 

JiU)=  [\u,U)dt  (13) 

Jo 

subject  to 

X{t)=Xit)U{t)  (14) 


with  X(0),  .X’(l),  U{0),  and  J7(l)  given.  (If  the  right-invariant  metric  were  used  instead  the 
constraint  would  then  be  X{t)  =  U{t)X{t).) 

On  SO  (3)  the  smoothness  functional  with  respect  to  the  bi-invariant  metric  turns  out 
to  be  the  integral  of  the  squared  Euclidean  norm  of  the  angular  acceleration.  In  order  to 
express  the  functional  in  terms  of  the  canonical  coordinates  the  following  result  is  needed. 
Let  R{t)  be  a  curve  in  SO(3)  parametrized  in  canonical  coordinates  by  R(t)  =  i?exp[^(t)], 
where  ^{t)  is  a  curve  in  ^  and  R  €  SO(3)  is  some  given  constant.  It  can  be  shown  that 
the  angular  velocity  in  body  coordinates,  denoted  is 

R-^  {t)R{t)  =  [a)(i)]  =  ds  (15) 

Jo 


which  can  be  further  simplified  to  the  vector  equation  a;(t)  =  where 


ll«ll= 


ii«r 


(16) 


The  smoothness  functional  in  canonical  coordinates  is  then 

AO  =  ll|(A({){)f*  (17) 

We  now  show  that  the  solutions  that  minimize  J(4)  are  “invariant”  (in  a  sense  to  be 
made  precise  below)  with  respect  to  right-  and  left-translations  of  the  boundary  values. 
This  result  is  not  surprising  considering  that  J(^)  is  defined  in  terms  of  the  bi-invariant 
Riemannian  metric  on  SO(3).  Nevertheless,  the  actual  calculations  turn  out  to  be  useful 
for  deriving  the  form  of  the  cubic  sphne  solution.  In  what  follows  it  may  be  helpful  to  bear 
in  mind  the  actual  engineering  problem  being  addressed,  which  is  to  interpolate  a  smooth 
orientation  trajectory  for  a  rigid  body  between  two  given  orientations,  subject  to  angular 
velocity  constraints  at  both  endpoints.  Suppose  that  inertial  and  body-fixed  reference 
firames  have  been  chosen.  The  interpolation  problem  can  then  be  stated  mathematically 
as  follows:  find  a  curve  R(t)  in  SO(3)  that  minimizes  J{^),  while  satisfying  the  boundary 
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conditions  R{0)  =  R^,  R{1)  =  R^,  R-^(0)R{0)  =  [wo],  and  ^-^(1)^(1)  =  [wi],  where 
RoiRi  €  SO(3),  wo,wi  €  axe  given.  For  convenience  we  parametrize  the  admissible 
curves  according  to  R{t)  =  where  ^(t)  is  a  curve  in  ^(t)  must  then  satisfy 


(i)  4(0)  =  0  (18) 

(ii)  [4(1)]  =  log(i?o  ^i?i)  (19) 

(in)  4(0)  =  Wo  (20) 

(iv)  ^(4(1))4(1)  =wi  (21) 


where  A(4)  is  as  given  in  Equation  (16).  If  now  a  different  body-fixed  firame  is  chosen, 
the  boundary  values  Rc  and  Ri  are  then  right-translated  by  some  0  €  SO  (3)  to  .^  = 
RqQ  and  Ri  =  RiQ,  respectively,  while  the  velocity  vectors  wo  and  wi  are  transformed 
to  ©~^wo  and  0“^wi,  respectively.  The  new  curve  R{t)  =  must  then  minimize 


Jii)  =r  So  ll^(^(f)4)lPrf*  while  satisfying 

(i)  4(0)  =  0  (22) 

(ii)  [4'(1)]  =  iog(.Ro '-Ri)  =  log(e-^R^^Rie)  =  0-M4(i)]e  (23) 

(Hi)  4(0)  =  ©“^Wo  (24) 

(w)  A(4(l))4(l)  =  0-iwi  (25) 


If  we  consider  4(*)  to  be  of  the  form  0~^4(t))  then  the  above  boundary  conditions  are 
identical  to  those  of  Equations  (18)-(21).  Moreover,  since  A(4)  =  0“^A(4)0,  it  follows  that 
A(4)4  =  ©“^A(4)4,  or  «^(4)  =  J(0-  Therefore,  if  4(t)  is  a  solution  to  the  original  variational 
problem,  then  4(t)  =  0~^4(t)  is  a  solution  to  the  latter.  Using  standard  matrix  exponential 
identities  one  can  now  show  that  R(t)  =  =  ilo0e®~^tC(t)]©  =  =  R(t)Q 

as  claimed. 

In  a  similar  fashion  one  can  show  that  if  Rq  and  Ri  are  left-translated  by  some  constant 
0  €  SO (3)  to  0i2o  and  0ili,  respectively  (corresponding  to  a  change  in  the  inertial  frame), 
then  the  new  solution  is  QR(t),  the  left-translate  of  the  original  solution.  Observe  that  the 
geodesics  on  SO  (3)  are  special  cases  when  4(^)  is  linear  in  t. 
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3.1  A  Cubic-Spline  Solution 

For  interactive  applications,  solving  the  above  two-point  boundary  value  problem  is  not 
practical.  However,  if  the  two  endpoints  are  assumed  reasonably  close  to  one  another  then 
the  solution  curve  simplifies  to  a  cubic  polynomial  in  canoniceJ  coordinates.  Specifically,  let 
Ro  and  i?i  denote  the  endpoints,  and  assume  ||  log(i2^^iii)||  is  small.  Then  the  interpolating 
curve  R{t)  =  will  be  such  that  ^(t)  can  also  be  assumed  of  small  magnitude,  and 

^(0  ^  T .  Therefore  the  smoothness  functional  of  Equation  (17)  can  be  approximated  by 

(26) 

Jo 

whose  solutions  are  clearly  cubics.  SO(3)  curves  whose  image  in  the  canonical  coordinates 
are  cubics  will  be  referred  to  as  cubic  spUnes  in  SO(3). 

The  cubic  sphne  solution  to  the  two-point  interpolation  problem  on  SO(3)  is  as  follows. 
Given, the  boundary  conditions  i?(0)  =  Rq,  i?(l)  =  Ri,  R~^{0)R{0)  =  [wq],  il~^(l)jR(l)  = 
[tJi],  the  solution  curve  is  R{t)  =  ,  where  a,b,c€^  are  constants  satisfying 

•  a  +  b  +  c  =  €,  where  [e]  =  log(i?o  ^-Ri) 

•  c  =  cuo 

•  A{€)(3a  +  2b  +  c)  =  o^i,  where  A{e)  —  I  —  Ho¬ 

using  calculations  analogous  to  those  of  the  previous  section,  it  can  be  shown  that  the  cubic 
splines  are  left-invariant.  That  is,  if  R{t)  is  a  cubic  sphne  satisfying  the  above  boundary 
conditions,  then  the  cubic  spline  R{t)  that  satisfies  the  new  boundary  conditions  .R(O)  = 
QRo,  R{1)  =  ©ill,  .R“^(0)jR(0)  =  [o^o],  and  .R"^(1).R(1)  =  [wi],  is  given  by  QR{i).  Right- 
invariance  can  also  be  shown  similarly.  The  cubic  sphnes  on  SO  (3),  therefore,  are  bi- 
invariant. 

The  following  matrix  identities  are  instrumental  in  determining  the  coefficients  of  Tight¬ 
er  left-translated  cubic  splines.  First,  for  any  matrix  A,  Pe^P~^  =  Secondly,  if 

R  €  SO(3),  then  =  [Ro;]  for  any  w  €  31^.  With  these  identities  it  can  be  shown 

that  if  R{t)  =  then  R{t)Q  =  ^  ^  ^  coeflBcients  of 

the  right-translated  cubic  spline  are  given  by  d  =  0“^a,  b  =  Q~^b,  and  c  =  0“^c.  Finally, 
observe  that  when  uq  =uji,  the  cubic  splines  reduce  to  geodesics.  This  featmre,  along  with 
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the  bi-invariance  property  and  computational  efficiency,  is  what  makes  cubic  splines  an 
attractive  choice  for  interpolating  trajectories  on  SO  (3). 


4  Interpolation  of  Multiple  Points 


We  now  present  a  complete  algorithm  for  interpolating  through  multiple  points  in  SO  (3) 
using  cubic  splines.  Analogous  to  the  Euclidean  case,  the  interpolated  curve  in  SO  (3) 
maintains  continuity  of  both  angular  velocities  and  accelerations  at  the  knot  points.  The 
algorithm  we  present  requires  the  following  as  inputs:  an  ordered  set  of  n  -I-  1  rotation 
matrices  •  •  j jRn}  (the  knot  points),  a  set  of  n  +  1  scalars  to  <  ti  <■■•<  tn 

(the  knot  times),  an  initial  angular  velocity  ljq  €  and  an  intial  angular  acceleration 
Qo  €  Both  ujQ  and  ao  are  expressed  in  body-fixed  reference  frame  coordinates.  This 
set  of  inputs,  while  a  slight  departure  from  the  usual  set  for  Euclidean  cubic  splines,  is 
chosen  for  convenience;  for  different  inputs  {e.g.,  specifying  final  velocities  rather  than 
initial  accelerations)  the  corresponding  algorithm  can  be  derived  using  results  from  the 
following  analysis. 

The  interpolated  curve  is  of  the  form 


Ri{t) 


m  =  { 


(  Rn(t) 


to  <t<ti 


tn-l<t<tn 


(27) 


where 


(28) 


Here  Oj,  6i,  Ci,  i  =  1, . . . ,  n,  are  constant  vectors  in  31^  that  are  determined  using  the  following 
formulas  for  the  angular  velocity  and  acceleration.  The  angular  velocity  in  body-fixed 
coordinates  is,  from  Equation  (16), 


a;(t)  =  { 


cji{t)  =  Ai{t)ii{t),  to<t<ti 


^n(t)  —  A,i(t)^ji(t),  tn— 1  ^  t  <  tn 


(29) 


where 


Aiit)  =  7- 


cos||^»(t)|| 

ll^i(i)P 


Ut)]  + 


||^i(t)||  -  ||sin^j(t)|| 


[^i(i)]' 


(30) 
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The  angular  acceleration  is 


where 


a{t)  =  { 


ai{t)  =  to<t<ti 


OlW  =  S  - -1^(003 1|?||-||«||  sin  ||{||-l)«x{)-l^g}a({x  I) 
+  P(3smM||  -  ||{||cos||{||  -  2||e||)(e  X  K  X  j)) 

+  “y^KxKxa  +  fxKxf)) 

Here  (•,  •)  denotes  the  Euclidean  norm  in  and  are  as  above,  and 

^i(t)  =  doi  +  2&i 


(32) 


(33) 


(34) 


The  iilitially  known  quantities  are,  in  addition  to  the  knot  points  and  knot  times,  a;i(to)  =  ^0 
and  ai(to)  =  chq-  Prom  these  initial  conditions  ai,6i,  and  Ci  can  be  uniquely  determined, 
from  which  wi(ti)  =  wi  and  Q!i(ti)  =  ai  can  in  turn  be  determined.  By  the  continuity 
requirements  wi(ti)  =  ^2(^1)  and  ai(ti)  =  Q:2(ii))  the  vector  coefficients  02,  b2,  and  C2  can 
now  be  determined.  This  procedure  is  repeated  until  all  the  coefficients  Oi,  bi,Ci,i  =  l,...,n 
have  been  found.  We  now  present  the  complete  algorithm. 


•  Given: 

{i2o,  i?!, .  •  • ,  Rn}  =  knot  points 
{tO)  ti)  •  •  • )  fn}  =  knot  times 

uq  =  angular  velocity  at  to  in  body-fixed  coordinates 
ao  =  angular  acceleration  at  to  in  body-fixed  coordinates 

•  Preprocessing:  for  i  =  1  to  n  find 

[ci]  =  log{Rf_^Ri) 

A  _  r  cosllcillr,  !  ,  l|€j||-||sin€<||  r.  i2 

||4P  (^*1 


•  Initialization: 


•  Recursion:  for  i  =  2  to  n  do 


®  =  ikill  (temp,  var.) 

t  =  3ai_i  +  26i_i  4-  Ct_i  (temp,  var.) 

u  =  6ai_i  +  26j_i  (temp,  vax.) 

a  =  Ai_iCi_i 

+1^(3  sin  l|s||  —  ||s||cos||s||  —  2||s|t)(s  x  («  x  t)) 
+M  jM(tx(sxi)  +  »x(»x«))) 

Ui  —  €{  hi  Ct 


•  Result:  for  tf-i  <  t  <  ttj 


ii(t)  =  /!,-i  expla,  (jrlij)'  +  (^Z^ 


The  ej  are  found  from  the  log  formula  of  Equation  (5).  In  cases  where  €{  has  two  possible 
values  (corresponding  to  the  two  antipodal  points  on  the  sphere  of  radius  tt),  either  value 
will  still  generate  the  same  orientation  trajectory  in  SO(3).  The  interpolated  curve  R{t)  is 
then  evaluated  using  the  exponential  formula  of  Equation  (4). 


Example 

In  this  section  we  provide  a  simple  example  to  illustrate  the  utUity  of  the  interpolation 
technique  developed  in  this  paper.  Figure  1  shows  several  positions  of  an  end  effector  of  a 
robot  manipulator.  These  positions  are  used  as  control  positions  and  a  cubic  interpolation 
is  performed  on  SO(3)  to  generate  the  motion  of  the  end  effector  depicted  in  Figure  2.  We 
have  separately  interpolated  the  orientations  from  the  position  of  a  point  on  the  end  effector. 
The  techniques  presented  in  this  paper  are  used  for  the  interpolation  of  the  orientation  parts 
of  the  trajectory. 


5  Conclusions 

By  viewing  SO(3)  as  a  Lie  group  equipped  with  a  natural  Riemannian  metric,  we  have 
presented  an  algorithm  for  interpolating  through  multiple  points  in  SO  (3)  that  can  be  in¬ 
terpreted  as  a  type  of  generalized  cubic  spline  in  rotation  space.  The  main  advantage  of 
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this  approach  is  that  bi-invariant  curves  can  now  be  generated  in  a  computationally  efficient 
way.  In  the  context  of  moving  rigid  bodies,  bi-invariance  ensures  that  the  orientation  trajec¬ 
tories  are  independent  of  choice  of  inertial  or  body-fixed  reference  frames.  Rotational  cubic 
splines  are  also  an  effective  compromise  between  the  computational  needs  for  interactive 
CAD  versus  greater  curve  smoothness. 

In  extending  the  rotational  cubic  sphne  techniques  to  the  interpolation  of  general  mo¬ 
tions  for  rigid  bodies,  several  additional  issues  need  to  be  addressed.  First,  it  is  a  well-known 
classical  result  that  SE(3),  the  Lie  group  of  rigid-body  displacements,  does  not  admit  a  bi¬ 
invariant  Riemannian  metric.  One  physical  consequence  of  this  fact  is  that  there  is  no 
interpolation  scheme  that  is  bi-invariant;  if  one  were  to  imagine  infinitely  large  rigid  bodies, 
then  any  method  of  motion  interpolation  will  ultimately  depend  on  the  choice  of  inertial  or 
body-fixed  reference  frame.  It  is  possible,  however,  to  relax  the  requirement  of  bi-invaxiance, 
in  which  case  left-  or  right-invariant  motions  can  be  generated  using  the  above  construction. 
The  most  straightforward  approach  is  to  interpolate  the  orientation  and  position  (of  some 
special  point  on  the  rigid  body)  trajectories  separately.  Alternatively,  the  exponential  and 
logarithm  mappings  on  SE(3)  and  its  Lie  algebra  (see,  e.g.,  Park  et  al  1993)  can  be  applied 
to  construct  cubic  splines  in  the  same  way  as  for  SO  (3).  The  difference  between  the  two 
approaches  is  best  illustrated  by  the  problem  of  interpolating  between  two  configurations  of 
a  rigid  body.  Assuming  the  body-fixed  frame  has  been  attached  to  the  center  of  mass,  and 
the  initial  generalized  velocities  and  accelerations  are  given,  the  former  approach  results 
in  a  linear  motion  of  the  center  of  mass.  The  latter  method,  however,  produces  a  screw 
motion  as  the  final  trajectory.  Interpolating  the  positions  and  orientations  separately  would 
therefore  seem  more  natural  from  the  point  of  view  of  dynamics. 
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Abstract 

This  paper  deals  with  calibrating  spatial  relationships 
among  CAD  models  of  objects  used  in  off  line  robot 
programming.  An  approach  based  on  a  Clifford  alge¬ 
bra  is  used  and  simple  algorithms  are  developed  for  this 
kinematic  localization  problem.  The  position  and  ori¬ 
entation  of  each  object  in  a  workcell  are  represented 
by  multivectors  of  different  ranks  and  the  localization 
problem  is  converted  into  that  of  solving  a  set  of  mul¬ 
tivector  equations.  Given  locations  of  points  in  two 
frames,  an  averaging  process  is  developed  which  yields 
the  best  orientation  as  the  root  of  four  linear  equations 
and  the  best  position  as  a  linear  combination  of  the 
best  rotation  and  the  centroids  of  two  point  systems. 
If  normal  vectors  are  also  sensed,  the  Clifford  algebra 
representation  facilitates  definition  of  a  measure  of  ori¬ 
entation  error  compatible  with  the  position  error. 

Introduction 

CAD-based  graphical  off-line  robot  programming  and 
simulation  requires  a  computer  representation  of  the 
robot  workcell  environment.  This  representation,  called 
the  world  model,  provides  the  necessary  information 
about  the  robot  workcell  for  a  task  planner  to  generate 
appropriate  robot  motion  control,  commands.  Errors 
in  task  execution  are  due  to  inaccuracies  of  the  world 
model  and  uncertainties  introduced  by  sensors  and  ac¬ 
tuators  during  execution.  The  latter  set  of  errors  . can 
be  eliminated  or  reduced  by  sensory  control.  The  first 
set  of  errors  may  be  corrected  by  calibrating  the  world 
model  before  task  execution.  This  paper  deals  with  the 
first  level  of  world  model  calibration  which  involves  cal¬ 
culation  of  location  of  coordinate  frames  attached  to  a 
CAD  model  and  is  here  referred  to  as  the  kinematic 
localization  problem. 


Calibration  of  the  world  model  first  requires  sensory 
interactions  with  the  robot  workcell.  In  the  past  there 
have  been  several  studies  related  to  sensory  monitoring 
of  the  robot  workcell  environment,  see  Grossman  and 
Taylor  (1978),  and  Ishii  et  al  (1988).  Any  of  the  sys¬ 
tems  proposed  in  these  papers  can  be  used  with  a  kine¬ 
matic  localization  algorithm  for  calibration  purposes. 
Kinematic  localization  problems  have  mostly  been  con¬ 
sidered  in  the  field  of  computer  vision  for  calculation 
of  position  and  orientation  of  objects  (see,  for  example, 
Grimson  and  Lozano-Perez  (1984),  Faugeras  and  Her¬ 
bert  (1986)  and  Gunnarsson  and  Prinz  (1987)).  These 
works  however  have  not  been  used  for  world  model  cal¬ 
ibration  and  more  importantly  have  not  completely  ex¬ 
ploited  the  geometric  nature  and  the  kinematic  struc¬ 
ture  of  the  problem.  The  resulting  algorithms  therefore 
have  suffered  from  unnecessary  nonlinearities  or  com¬ 
putational  requirements. 

In  this  paper,  a  representation  based  on  Clifford  al¬ 
gebra  is  -used  for  the  kinematic  relationship  between 
two  frames.  Clifford  algebra  was  proposed  by  Clif¬ 
ford  (1876).  For  recent  account  of  Clifford  algebra,  see 
Hestenes  and  Sobczyk  (1984),  McCarthy  (1990).  The 
Clifford  algebra  formulation  converts  the  localization 
problem  into  that  of  solving  a  set  of  multivector  equa¬ 
tions.  When  unit  vector  normals  are  also  measured, 
in  addition  to  point  measurements,  this  representation 
facilitates  definition  of  a  measure  of  orientation  error 
compatible  with  the  position  error.  More  importantly, 
it  also  facilitates  exploitation  of  the  geometric  nature 
and  kinematic  structure  between  two  point  system  mea¬ 
sured  with  respect  to  the  world  frame  and  to  the  natural 
frame.  This  results  in  an  averaging  process  which  re¬ 
duces  the  localization  problem  to  that  of  solving  a  set 
of  four  linear  equations.  The  orientation  of  an  object 
obtained  as  the  root  of  the  equations  is  shown  to  be  a 
least  squares  solution  and  the  best  position  of  the  ob- 
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ject  is  a  linear  combination  of  the  optimal  orientation 
and  the  centroids  of  the  point  systems  in  two  frames. 


1  The  Clifford  Algebra 

The  Clifford  algebra  C(R^)  of  a  3-dimensional  vector 
space  (R^)  is  generated  from  elements  of  the  space  by 
a  product  operation  called  the  geometric  product  This 
product  is  linear  and  associative,  and  in  addition,  the 
products  of  basis  vectors  of  R^  satisfy  the  following 
properties  (see  McCarthy  1990  and  Ge  1990): 


— 


for  m  ^  n, 
1  for  m  =  n, 


(1) 


where  ^  =  1>2,3,  are  the  basis  vectors  of  R^  such 
that  Cm  has  a  1  in  the  position  and  zeroes  else¬ 
where. 

The  geometric  product  em^n  is  not  a  vector  nor  a 
scalar  but  a  new  entity  called  rank  2  muliivecior  or 
bivecior.  In  Clifford  algebra,  a  scalar  is  considered  as  a 
rank  0  muliivecior  and  a  vector  is  a  rank  1  muliivecior. 
The  geometric  product  of  all  three  basis  vectors 


where  q*"  =  (-^i , -^^2, -^3’ ?4)  is  the  conjugate  of  q 
and  [,c“],  [x"^]  are  4  x  4  skew  symmetric  matrices  given 
by 


0 
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Xi 
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-X3 

0 
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0 

The  product  of  these  two  matrices  commutes,  i.e. 
[x"][x'^]  =  In  addition,  we  have  = 

[x”][x“]  =  ““|xp[/]  where  |x|“  is  the  length  square  of 
the  vector  x  and  [/]  is  the  4x4  identity  matrix. 

It  can  also  be  shown  that  the  product  of  three  ele¬ 
ments,  X  (a  vector),  q  (an  element  of  C“‘"(R^))  and  y 
(another  vector)  can  be  expressed  in  matrix  form  as 

xqy  = (6) 

where  [y~]  is  obtained  from  [x~]  by  replacing  x  with 

y- 


i  =  616263,  (2) 

is  a  rank  3  muliivecior  or  irivector.  It  has  the  property 
that  =  —  1  and  commutes  with  every  vector  in  the 
space.  Furthermore,  it  facilitates  representation  of  basis 
bivectors  in  terms  of  basis  vectors: 

6263  =  i6i,  6361  =:  ie2,  6162  =  ies.  (3) 

The  geometric  product  of  two  general  vectors,  x  = 
xiei  4-  ^262  +  0:363  and  y  =  yi6i  +  2/2^2  +  2/3^3,  is  a 
combination  of  a  scalar  and  a  bivector: 

xy  =  x-y-{-2'(xxy),  (4) 

where  x  -  y  is  the  vector  scalar  product  and  x  x  y  is  the 
vector  cross  product. 

A  general  element  of  the  Clifford  algebra  of  R^  is  a 
combination  of  multivectors  with  ranks  from  0  to  3.  It 
includes  a  scalar  term,  a  vector  term  with  three  basis 
vectors  em,  tti  —  1,2,3,  a  bivector  term  with  three  ba¬ 
sis  bivectors  26^,  and  a  trivector  term  i  =  616063.  The 
set  of  elements  of  as  combinations  of  scalars  and  bivec¬ 
tors  constitutes  the  even  subalgebra  of  C(R^).  The  set 
of  elements  of  as  combinations  of  vectors  and  tri vec¬ 
tors  constitutes  the  odd  subalgebra.  The  conjugates 
for  these  elements  are  obtained  by  replacing  i  with  —i. 

The  product  of  a  vector  x  =  xiei  -h  X2e2  +  ^363  with 
an  even  element  q  =  2^161  -f  iq2^2  +  can  be 

expressed  in  matrix  form  as 

xq=[x~]q*,  qx=[x+]q*, 


2  Kinematic  Relationships  in 
the  World  Model 

A  world  model  usually  includes  geometrical,  relational 
as  well  as  physical  descriptions  of  the  workcell.  The 
world  model  EWORLD  proposed  by  Ravani  (1988)  uses 
a  multi-primitive  representation  of  the  workcell  environ¬ 
ment  at  an  abstract  level.  Frame  primitive  is  used  to 
mark  the  position  and  orientation  of  a  fixed  reference 
frame  in  space  (called  the  world  frame).  It  can  be  used 
to  mark  the  location  of  an  object  or  device,  in  which 
case  they  are  referred  to  as  natural  frames.  It  can  also 
be  used  to  mark  feature  locations  on  objects  or  devices, 
in  which  case  they  are  referred  to  as  auxiliary  frames. 

This  section  identifies  the  spatial  relationship  be¬ 
tween  a  natural  frame  and  a  world  frame  (or  between  a 
auxiliary  frame  and  a  natural  frame)  with  an  element  of 
the  Clifford  algebra  of  R^.  The  Clifford  algebra  repre¬ 
sentation  is  closely  related  to  the  quaternion  representa¬ 
tion  of  transformations,  see  Bottema  and  Roth  (1979), 
McCarthy  (1990)  and  Ge  (1990). 

The  kinematic  relationship  between  two  frames  con¬ 
sists  of  an  orientation  relationship  and  a  positional  re¬ 
lationship.  The  orientation  relationship  is  represented 
by  the  following  even  element  (Ge  1990): 

q  =  2ssin(^/2)  4-cos(f?/2),  (7) 

where  s  =  Sj^ei  4-5^62+5^63  is  the  unit  vector  along  the 
axis  of  rotation  and  0  the  angle  of  rotation.  The  four 
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components  of  q  =  (91,  g2. 43, 94)  given  by  (7)  satisfy 
the  relation  ^  ,  , , , 

9i  +  92  +  ^3  +  94  =  i’, 

and  are  called  the  Euler  parameters  of  rotation. 

If  the  location  of  the  origin  of  one  frame  relative  to 
another  is  known,  say  it  is  specified  by  the  vector  d  - 
diBi  +  (^262+^363,  then  the  positional  relationship  is 
represented  by  the  following  geometric  product 

q°  =  (l/2)qd,  (9) 

where  q  is  the  orientation  defined  by  (7).  In  general, 
the  positional  relationship  is  given  by 

q°  =  (l/2)(qx' -  xq),  (10) 

where  x  =  ziei  +  Z2®2  +  *^363  S'!!*!  + 

2363  are  the  coordinate  vectors  of  a  point  measured 
relative  to  the  natural  frame  and  to  the  world  frame,  q 
is  an  element  of  the  odd  subalgebra,  i.e.  a  combination 
of  a  vector  and  a  trivector. 

Thus  the  pair  of  elements,  (q,q°),  uniquely  deter¬ 
mine  the  kinematic  relationship  between  two  frames. 
Note  that  the  pair  (-q,  -q°)  represents  the  same  re¬ 
lationship  as  (q,q°).  Furthermore,  the  definition  of  q 
implies  that  the  components  of  q  and  q°  satisfy 

-  9i9i  -  9292  “  9393  +  949°  =  0.  (11) 

For  orientation  relationship,  the  element  q°  =  0  since 
d  =  0.  Therefore,  the  pair  of  unit  vectors  (u  ;  u') 
represents  the  direction  of  a  line  in  both  frames  satisfy 

qu'  —  uq  =  0.  (12) 


be  t^iven  by  two  pairs  of  unit  vectors  (u  ;  u')  and  (v  ;  v 
where  u,  v  are  measured  relative  to  the  object  s  natura 
frame  and  u',  v'  are  measured  relative  to  the  world 
frame.  Each  pair  satisfies  (12),  i.e. 

qu'  —  uq  =  0,  (1*^) 

qv'  —  vq  =  0.  (1‘1) 

The  substitution  of  q  =  issin(0/2)  +  cos(^/2)  into 
.  (13)  yields,  after  some  algebra 

sin{6i/2)s  X  (u'  -b  u)  -  cos(0/2)(u'  -  u) 

—i  sin(0/2)s  •  (u'  —  u)  —  0. 

Eq.(15)  separates  into  a  vector  equation 

sin(0/2)s  X  (u'  +  u)  -  cos(0/2)(u'  -  u)  =  0.  (16) 

and  a  scalar  equation  sin(0/2)s  ■  (u'  -  u)  =  0.  The 
vector  equation  is  the  well-known  Rodrigues  equation 
and  the  scalar  equation  indicates  that  the  rotation  axis 
s  is  perpendicular  to  the  vector  (u'  -  u). 

Taking  the  cross  product  of  both  sides  of  (16)  wit 

(v'  —  v)  yields 


e  (v'  -  v)  X  (u'  -  u) 
p  =  tan(-)s  = 


Note  that  in  obtaining  (17)  we  have  used  the  fact  l,hat 
5  .  (v'  —  v)  =  0  which  can  be  derived  from  (14),  see 
also  Bottema  and  Roth  (1979).  Thus  the  orientation 
relationship  is  specified  by 


3  Minimal  Solution 

To  calibrate  the  kinematic  relationship  between  two 
frames  for  CAD  models  is  to  determine  the  real  trans¬ 
formation  from  one  frame  to  another.  In  the  Clifford 
algebra  representation,  it  is  to  determine  the  pair  of 
elements  (q,q°)  corresponding  to  the  transformation. 
This  section  focuses  on  calibrating  the  kinematic  re¬ 
lationship  between  an  object’s  natural  frame  and  the 
world  frame.  We  first  discuss  the  minimum  informa¬ 
tion  required  for  the  calibration  with  the  assumption 
that  the  sensory  information  about  geometric  features 
in  both  frames  are  available.  We  then  provide  an  algo¬ 
rithm  for  calibration  with  point  measurements  only  in 
the  world  frame. 

3.1  Minimum  Sensory  Information 

To  determine  the  orientation  relationship  q,  directions 
of  two  distinct  lines  are  required.  Let  the  two  directions 


where  jpj  is  the  length  of  the  vector  p. 

To  obtain  the  positional  relationship  q  ,  the  location 
for  one  point  of  the  object  is  required.  Let  (x  :  x  )  be  a 
pair  of  vectors  for  the  locations  of  a  given  point  relative 
to  the  two  frames.  Then  q°  is  given  by  (10). 

For  point  measurements,  a  minimum  number  of  three 
non-collinear  points  are  required  to  determine  both  q 
and  q°.  This  is  evident  by  the  fact  that  two  direction 
vectors  can  be  constructed  to  determine  q  and  one^of 
the  three  points  can  be  used  to  further  determine  q  . 

3.2  Calibration  with  Information  only 
in  the  World  Frame 

Given  a  set  of  point  locations  in  the  world  frame,,  any 
transformation  (q,q'’)  can  be  used  to  map  these  loca¬ 
tions  to  the  natural  frame.  Therefore,  the  uniqueness 
of  the  relationship  (q,q°)  is  not  a  meaningful  topic  ot 
discussion.  The  focus  here  is  on  how  to  sekct  a  natu¬ 
ral  frame  for  a  given  set  of  point  locations  in  the  worl 
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frame.  We  have  obtained  formulas  for  uniquely  deter¬ 
mining  (q,  q^)  in  terms  of  locations  of  the  minimum 
number  of  points  measured  in  both  the  natural  frame 
and  the  world  frame.  These  formulas  can  be  regarded 
as  defining  (q,q®)  as  functions  of  the  unknown  point 
locations  in  the  natural  frame.  The  number  of  the  un¬ 
knowns  can  be  considered  as  the  “degree  of  freedom” 
of  the  calibration  system  which  allows  us  to  select  a 
specific  natural  frame.  In  what  follows  we  propose  a 
special  choice  of  natural  frame  for  a  given  set  of  point 
measurements. 

Three  non-collinear  points,  x,  y  and  z  measured  rel¬ 
ative  to  the  world  frame,  determines  a  plane.  The  cross 
product  of  two  vectors  on  this  plane,  u  =  z  —  x  and 
V  =  y  —  X,  yields  the  unit  normal  for  the  plane: 

U  X  V 

W  =  -j - r. 

|u  X  v| 

We  select  the  orientation  of  the  natural  frame  such 
that  its  z-axis  (63)  is  parallel  to  the  unit  normal  w  and 
its  x-axis  (ei)  is  parallel  to  the  vector  u.  Thus  the  two 
pairs  of  direction  vectors  (w,  03)  and  (u/jul,  ei)  can 
be  used  to  determine  the  orientation  q.  Furthermore, 
if  the  point  x  is  selected  as  the  origin  of  the  natural 
frame,  then  the  position  is  given  by  q°  =  (l/2)qx. 

3.3  Calibration  of  Kinematic  Relation¬ 
ship  Between  Any  Two  Frames 

To  calibrate  the  kinematic  relationship  between  any  two 
frames,  we  calibrate  first  the  spatial  relationship  of  each 
frame  relative  to  the  world  frame  and  then  derive  the 
relationship  between  the  two  given  frames'  from  it.  Let 
(qi,q5)  represent  the  location  of  one  frame  Ni  relative 
to  the  world  frame  and  (q2,q2)  represent  the  location 
of  another  frame  N2,  Figure  1.  Let  x,  Xi  and  X2  be  the 
coordinate  vectors  for  a  point  measured  relative  to  the 
world  frame,  the  frame  Ni  and  the  frame  N2,  respec¬ 
tively.  They  are  related  by  the  transformations  (10) 

as. 

qix-xjqi  =  2q?,  (18) 

q2X-X2q2  =  2q§.  (19) 

Eliminate  x  from  these  two  equations  to  obtain 

(q2qjxi  -  X2q2ql)  =  2(qSqi  -  q2(q?)*), 

The  pair  of  elements  (q,  q^)  representing  the  transfor¬ 
mation  from  Ni  to  N2  is  therefore  given  by 


Figure  1:  The  composition  of  kinematic  relationships. 

4  Least  Squares  Solution 

In  the  presence  gf  measurement  errors,  greater  accu¬ 
racy  in  determining  (q,q®)  may  be  attained  by  mea¬ 
suring  more  than  three  points.  This  section  presents 
an  averaging  process  which  filters  the  measurement  er¬ 
rors  and  yields  a  solution  that  is  symmetric  to  the  in¬ 
put  data.  The  process  exploits  the  geometric  nature 
and  kinematic  structure  between  two  measured  point 
systems  to  convert  the  kinematic  localization  problem 
to  that  of  solving  a  set  of  linear  equations.  The  solu¬ 
tion  to  the  problem  is  also  optimal  in  the  sense  that  it 
minimizes  the  squares  of  the  position  errors.  Further¬ 
more,  if  unit  vector  normals  are  also  measured  then  the 
Clifford  algebra  representation  facilitates  definition  of 
a  metric  measure  of  orientation  error  compatible  with 
the  position  error. 

4.1  Point  Measurements 

Let  n  vectors,  Xj  where  f  =  1, 2,  ♦  •  • ,  n,  represent  n  po¬ 
sitions  of  points  relative  to  object’s  natural  frame,  and 
the  measured  values  of  the  n  position  vectors  be  de¬ 
noted  as  Sci  with  i  =  1, 2,  •  •  * ,  n.  The  kinematic  local¬ 
ization  problem  is  to  obtain  the  orientation  q  and  the 
position  q®  of  the  natural  frame,  such  that  they  best 
satisfy  the  following  transformation  equations: 

qx,* -x^q- 2q°  =  0,  i=l,2,  (20) 

Sum  all  n  equations  in  (20)  and  we  have 

q°  =  (l/2)(qxc  ~  Xcq),  (21) 

where  the  vectors  Xc  and  x^  defined  by 


q  =  q2q: ,  q°  =  q^ql  -  q2(q?)*  • 


(22) 
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represent  the  centroids  of  two  systems  of  position  vec¬ 
tors,  Xj  and  Xi  where  z  =  1, 2,  •  •  • ,  n. 

After  the  substitution  of  (21)  into  (20),  the  kinematic 
localization  problem  becomes  that  of  finding  the  rota¬ 
tion  q  that  best  satisfies  the  n  equations 

q(xi  -  xc)  -  (xi  -  xe)q  =  0,  z  =  l,2,---,n.  (23) 

Postmultiply  both  sides  of  (23)  with  x;  to  obtain 

q|x,-  P  -  qxcXi  -  Xiqxi  +  x^qx,-  =  0.  (24) 

Premultiply  both  sides  of  (23)  with  x,-  and  we  have 

x.qxi  -  x.qxe  -  |x,  pq  +  x.-x^q  =  0.  (25) 

The  subtraction  of  (24)  from  (25)  yields 

2x,qxi  -  (xj-qxc  +  XcqXj)  +  qx^x,  +  XjX^q 

-(!i.|-  +  |x.P)q  =  0. 

Sum  all  equations  of  the  form  (25)  for  i  =  1, 2,  •  •  • ,  n 
and  in  view  of  (22),  we  obtain 

2^(x.qx.-x,qxe)  =  ^(|xip-|x,p  +  |xip-|x,p)q. 

*  =  1 

(27) 

The  best  fit  orientation  q  is  the  solution  of  the  multi¬ 
vector  equation  (27).  Using  (6),  this  equation  can  be 
put  in  the  matrix  form 

[^]q  =  Aq  (28) 

where  [A]  is  the  4x4  symmetric  matrix  given  by 

n 

•  W  =  2^(K+][x-]-[x+][z7]),  (29) 

1=1 

A  is  a  scalar  given  by 

n 

'^  =  -Z!(|x.P-  |xcp  +  |xip-  |x,p).  (30) 

1  =  1 

The  matrices  in  (29),  [xf],  [xf]  and  [xj],  [x+]  are  skew- 
symmetric  matrices  obtained  from  (5)  by  replacing  x 
with  X,-,  x,“,  Xc  and  Xc,  respectively.  The  matrix  [.4] 
is  symmetric  since  the  matrix  products  in  (29)  com¬ 
mute.  Since  q  can  not  be  a  zero  vector,  the  matrix 
[A]  “  A[/],  where  [7]  is  a  4  x  4  identity  matrix,  must  be 
singular.  This  indicates  that  A  is  also  an  eigenvalue  of 
the  symmetric  matrix  [A]  and  q  is  the  corresponding 
eigenvector. 

In  the  remainder  of  this  section  we  prove  that  the 
kinematic  relationship  (q,q®)  obtained  in  the  above 
fashion  is  also  the  least  squares  solution  to  the  set  of  n 
equations  (20),  which  can  be  put  in  matrix  form  as: 

[Pi]q*  -  2q°  =  0,  t  =  l,2,--.,n.  (31) 


where  [Pj]  is  a  4  x  4  skew  symmetric  matrix  given  by 

[P.]  =  [£+]  -  [x-].  (32) 

The  least  squares  solution  to  (31)  is  to  minimize  the 
sum  of  the  error  squares 

77 

-  V)"'([P.]q'  -  2q°),  (33) 

1=1 

by  variation  of  q*  and  subject  to  (8)  and  (11),  i.e. 

(q-)V  =  P  (q*)^q°  =  0.  (34) 

These  two  conditions,  however,  are  not  real,  constraints 
on  the  problem.  The  condition  (q*)^q^  =  0  is  implied 
by  the  skew  symmetry  of  [Pj],  and  the  optimal  solution 
(q*jq°)  is  proportional  to  the  length  \/(q*)^q*  since  q° 
is  a  linear  function  of  the  components  of  q  as  indicated 
by  (9),  Therefore,  the  localization  problem  is  reduced 
to  a  unconstrained  least  squares  problem. 

Differentiate  first  with  respect  to  q^  and  we  have 

-4f;([P]q*-2q°)  =  0. 

1  =  1 

This  leads  to 

q°  =  (l/2)[Pjq-,  (35) 

where  the  matrix  [P^]  =  n  2ir=i['R]-  is  obvious  that 
(35)  is  the  matrix  form  of  (21). 

Now  differentiate  with  respect  to  q*  to  obtain 

77 

2y3[Pf([P]q*-2q°)  =  0.  (36) 

1=1 

Substitute  (35)  into  (36)  and  we  have,  after  rewriting 
the  result  in  multivector  form: 

2^  (*i  q‘ Xi -Xoq*  Xe  )  =  ^(  |Xi  p_  |x,  p+|xf  P- |Xc  P  )q* . 
1=1  1=1 

(37) 

Taking  conjugation  on  both  sides  of  (37)  yields  the  de¬ 
sired  (27). 

In  view  of  (35)  and  (36),  it  can  be  showm  that  the 
minimum  error  Ex  =  0.  This  completes  the  proof  that 
the  result  of  the  averaging  process  outlined  by  (21), 
(23),  (26)  and  (27)  is  the  least  squares  solution. 

4.2  Including  Normal  Vector  Measure¬ 
ments 

Let  the  measured  values  of  ?n  unit  vectors  Uj  be  denoted 
as  \ij  where  j  =  l,2,--,m.  VVe  seek  to  find  q  and 
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such  that  they  best  satisfy  both  the  point  and  the 
direction  equations: 


for  points:  [Pi]q*  —  =  0,  z=l,2,---,n  (38) 

for  directions:  [Dj]q*  =  0,  j  =  1, 2,  •  •  • ,  m.  (39) 

In  (38),  the  matrix  [Pi\  is  given  by  (32)  and  [Dj]  = 
[y+]  —  [yT]  where  [uf],  [uj*]  are  obtained  from  (5)  by 
replacing  x  with  Uj,  respectively. 

The  least  squares  solution  to  (38)  and  (39)  is  the  pair 
q*  and  q*^  that  minimizes  the  sum: 


E  =  E”=1 


(40) 


where  a,*  and  0j  are  the  weighting  factors. 

The  same  procedure  as  the  localization  based  on 
point  measurements  is  applied  to  reduce  the  problem 
to  that  of  solving  the  following  linear  equations 


[B]q=Aq  (41) 

where  [5]  is  a  4  x  4  symmetric  matrix  given  by 

n  m 

[B]  =  2^ai{[xt][x-]  +  [xt][x:])  +  25^/3, -btliu-], 
*=i  i=i 

(42) 

and  A  is  a  scalar  given  by 


A  =  -^aff(|xfl2-|xcp+lx,|^-lxcl^>-^/3j(lujl'+iujp). 
i=i  j=i 

(43) 

The  vectors  Xc  and  Xc  defined  by 

Y  ~  QftXj  _  /44\ 

are  the  weighted  centroids  of  points  manured  relative 
to  the  world  frame  and  to  the  natural  frame,  respec¬ 
tively. 


Conclusion 

This  paper  develops  simple  algorithms  for  calibrating 
the  kinematic  relationships  among  the  CAD  models  of 
objects  based  on  a  Clifford  algebra.  This  formulation 
converts  the  kinematic  localization  problem  into  that 
of  solving  a  set  of  multivector  equations.  It  facilitates 
definition  of  a  metric  measure  of  orientation  error  com¬ 
patible  with  the  position  error.  The  geometric  and  kine¬ 
matic  structures  of  two  measured  point  systeins  are  ex¬ 
ploited  in  an  averaging  process  which  reduces  the  prob¬ 
lem  to  that  of  solving  four  linear  equations.  The  so¬ 
lution  to  the  problem  is  symmetric  to  input  data  and 
minimizes  the  squares  of  position  and  orientation  er¬ 
rors.  The  averaging  process  presented  in  this  paper  is 
directly  applicable  to  localization  problems  in  computer 
vision. 
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Computation  of  Spatial 
Displacements  from  Redundant 
Geometric  Features 

This  paper  follows  a  previous  one  on  the  computation  of  spatial  displacements  (Ravani 
and  Ge,  1993).  The  first  paper  dealt  with  the  problem  of  computing  spatial 
displacements  from  a  minimum  number  of  simple  features  of  points,  lines,  planes,  and 
their  combinations.  The  present  paper  deals  with  the  same  problem  using  a  redundant 
set  of  the  simple  geometric  features.  The  problem  for  redundant  information  is 
formulated  as  a  least  squares  problem  which  includes  all  simple  features.  A  Clifford 
algebra  is  used  to  unify  the  handling  of  various  feature  information.  An  algorithm  for 
determining  the  best  orientation  is  developed  which  involves  finding  the  eigenvector 
associated  with  the  least  eigenvalue  of  a  4  X  4  symmetric  matrix.  The  best  translation  is 
found  to  be  a  rational  cubic  function  of  the  best  orientation.  Special  cases  are  discussed 
which  yield  the  best  orientation  in  closed  form.  In  addition,  simple  algorithms  are 
provided  for  automatic  generation  of  body-fixed  coordinate  frames  from  various  feature 
information  The  results  have  applications  in  robot  and  world  model  calibration  for 
off-line  programming  and  computer  vision. 


Introduction 

A  solution  of  the  problem  of  computing  a  spatial  displace¬ 
ment  from  position  data  of  a  minimum  number  of  simple 
features  of  points,  lines,  planes,  and  their  combinations  is 
provided  by  Ravani  and  Ge  (1993).  In  practical  applications, 
however,  redundant  features  are  measured  to  filter  errors  in 
sensor  measurements  and  to  improve  reliability  of  the  mea¬ 
suring  system.  This  paper  formulates  this  case  as  a  least 
squares  problem  and  determines  the  solution  in  closed  form 
taking  advantage  of  the  geometric  structure  of  the  problem 
and  considering  different  features  of  points,  lines,  planes  and 
their  combinations. 

The  problem  of  computing  a  spatial  displacement  has 
been  studied  by  many  researchers  using  only  point  features. 
Most  researchers  used  position  data  of  three  noncoUinear 
points,  see  for  example,  Beggs  (1966),  Bottema  and  Roth 
(1979),  Laub  and  Shiflett  (1982),  and  Angeles  (1986).  A 
comparison  of  these  methods  can  be  found  in  Fenton  and  Shi 
(1990).  When  more  than  three  points  are  considered,  least 
squares  approximation  methods  are  commonly  adopted. 
Spoor  and  Veldpaus  (1980)  developed  a  least  squares  method 
for  computing  spatial  displacements  of  which  rotations  were 
represented  by  orthonormal  matrices,  the  orthonormality  of 
rotation  matrices  (or  rigidity)  were  enforced  through  the  use 
of  Lagrange  multipliers.  Although  not  explicitly  stated,  the 
best  translation  vector  was  obtained  as  the  difference  be¬ 
tween  the  centroid  of  the  coordinates  in  one  system  and  the 
displaced  centroid  of  the  coordinates  in  the  other  system. 
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Computation  of  the  best  rotation  matrix  was  more  involved 
including  the  solution  of  an  eigenvector  problem  of  a  3  x  3 
symmetric  matrix  and  matrix  multiplication.  But  the  solution 
was  of  closed-form  and  no  iteration  was  required.  Horn 
(1987)  presented  a  more  general  least  squares  method  which 
included  the  scaling  factor  (for  applications  in  photogramme- 
tiy)  and  obtained  the  same  results  for  the  best  translation  as 
Spoor  and  Veldpaus  (1980)  but  with  a  clearer  geometric 
interpretation.  He  also  showed  that  if  unit  quaternions  were 
used  to  represent  rotations,  the  best  rotation  was  the  quater¬ 
nion  obtained  as  the  eigenvector  associated  with  the  most 
positive  eigenvalue  of  a  4  x  4  symmetric  matrix.  All  the 
above  works,  however,  have  only  dealt  with  computation  of 
displacements  from  point  features  and  have  not  included 
other  simple  geometric  features  of  lines,  planes,  and  their 
combinations  with  points. 

Ravani  and  Ge  (1993)  developed  a  general  framework  for 
computing  spatial  displacements  from  position  data  of  simple 
geometric  features  which  include  not  only  points  but  also 
lines,  planes  and  their  combinations  with  points.  They  dis¬ 
cussed  issues  related  to  uniqueness  of  the  computation  and 
minimum  number  of  required  features  and  observed  that  for 
the  duality  between  points  and  planes  to  be  valid,  orienta¬ 
tions  of  these  features  need  be  considered.  They  concluded 
that  if  orientation  information  was  not  specified  than  specifi¬ 
cation  of  four  (rather  than  three)  points  features  was  neces¬ 
sary  for  computation  of  a  unique  displacement.  They  repre¬ 
sented  simple  features  of  points,  lines,  and  planes  by  multi- 
vectors  and  provided  equations  for  displacements  of  these 
features  using  a  Clifford  algebra  of  multiveaors. 

This  paper  builds  on  the  work  of  Ravani  and  Ge  (1993)  to 
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study  the  problem  of  computing  spatial  displacements  from 
redundant  position  data  of  point,  line,  and  plane  features 
and  their  combinations*  Here  orientation  is  not  an  important 
issue  since  redundant  rather  than  minimum  position  data  is 
considered.  A  least  squares  approach  is  adopted  and  all 
simple  features  are  represented  in  homogeneous  coordinates 
and  are  manipulated  as  multivectors.  The  use  of  homoge¬ 
neous  coordinates  allows  representations  of  the  features  that 
are  independent  of  the  measuring  techniques  employed.  An 
algorithm  for  determining  the  best  orientation  is  then  devel¬ 
oped  which  involves  finding  the  eigenvector  associated  with 
the  least  eigenvalue  of  a  4  x  4  symmetric  matrix.  The  best 
translation  is  found  to  be  a  rational  cubic  function  of  the  best 
orientation.  Special  cases  are  also  discussed  which  yield  the 
best  orientation  in  closed  form.  In  addition,  simple  algo¬ 
rithms  are  developed  for  automatic  generation  of  body-fixed 
coordinate  frames  from  various  feature  information. 

In  practical  applications,  the  position  data  of  all  simple 
features  of  points,  lines,  and  planes  can  be  determined,  by 
using  feature  extraction  algorithms,  from  visual  data  gener¬ 
ated  by  computer  vision  hardware.  Point  and  line  features 
can  also  be  measured  directly  by  using,  for  example,  a  touch- 
trigger  probe  and  a  theodolite,  respectively.  This  method  for 
computing  spatial  displacements  from  point  features  has 
found  a  number  of  applications  in  various  fields.  In  robotics 
and  automation,  it  can  be  used  to  estimate  the  pose  (position 
and  orientation)  of  a  workpiece  to  be  grasped  by  a  robot 
(Chen  et  al.,  1980)  and  to  determine  a  part  location  in  CAD 
model-based  flexible  manufacturing  (Gunnarsson  and  Prinz, 
1987);  in  artificial  intelligence,  it  can  be  used  for  object 
identification  and  localization  (Grimson  and  Lozano-Perez, 
1984)  and  for  rigid-body  motion  estimation  (Luo  and  Yang, 
1990);  in  biomechanicsi  it  can  be  used  for  kinematic  analysis 
of  bone  movements  (Spoor  and  Veldpaus,  1980);  and  in 
photogrammetry  it  can  be  used  to  recover  the  transformation 
between  two  coordinate  systems  (Horn,  1987).  The  results  in 
this  paper,  in  addition  to  their  theoretical  interest  in  compu¬ 
tational  geometry  of  motions,  facilitate  the  use  of  measuring 
systems  for  the  ^orementioned  applications  that  can  handle 
not  only  point  features  but  also  lines  and  plane  features. 

The  outline  of  the  paper  is  as  follows.  We  first  formulate 
the  problems  of  computing  spatial  displacements  from  points, 
lines,  and  planes,  in^vidually.  We  then  put  them  together  to 
obtain  a  formulation  that  applies  to  all  cases.  The  result  is  a 
constrained  least  squares  problem  and  a  Lagrange  multiplier 
technique  is  employed  for  the  solution.  Special  cases  are  then 
discussed  from  which  closed-form  solutions  are  obtained. 
The  last  section  presents  simple  algorithms  for  automatic 
generation  of  body-fixed  coordinate  frames. 


1  Preliminaries 

This  section  gives  a  brief  account  on  representations  of 
simple  features  of  points,  lines,  planes  and  uses  a  Clifford 
algebra  to  represent  displacements  of  these  simple  features. 

1.1  Representations  of  Simple  Features.  The  position 
data  of  a  point  feature  can  be  obtained  directly  by  using  a 
coordinate  measuring  machine;  it  can  also  be  computed  from 
the  position  data  of  line  and  plane  features,  as  the  intersec¬ 
tion  of  a  line  with  a  plane  or  die  intersection  of  three  planes. 
The  location  of  a  line  feature  can  be  determined  from  the 
position  data  of  two  distinct  points,  or  a  point  and  a  direc¬ 
tion,  or  two  planes.  The  location  of  a  plane  feature  is 
determined  by  measuring  the  normal  direction  and  the  dis¬ 
tance  from  the  origin.  It  can  also  be  determined  indirectly  by 
measuring  a  point  and  a  line  or  three  points  on  the  plane. 

In  projective  geometry,  points  and  planes  are  represented 
by  homogeneous  coordinates  and  lines  are  represented  by 
Pliicker  coordinates  so  that  the  principle  of  duality  can  be 


applied  when  manipulating  these  features.  Ravani  and  Ge 
(1993)  used  a  Clifford  algebra  of  projective  three-space  and 
associated  point,  line,  and  plane  features  with  multivectors  of 
rank  1,  2,  and  3.  In  this  way,  the  geometric  operations  such  as 
meet  and  joint  of  these  features  can  be  carried  out  alge¬ 
braically  by  using  the  wedge  product  and  dual  operation  of 
multivectors.  This  allows  for  conversion  of  various  represen¬ 
tations  of  point,  line,  and  plane  features  (due  to  different 
measuring  techniques)  to  their  respective  standard  homoge¬ 
neous  form.  This  facilitates  unified  handling  of  these  features 
that  are  independent  of  the  measuring  techniques. 

In  what  follows,  points  are  represented  by  homogeneous 
coordinates  of  the  form  (x,  1)  where  x  =  (z^  Z2,  x^)  are  the 
Cartesian  coordinates;  lines  are  represented  by  the  Pliicker 
coordinates  of  the  form  (u,  vP)  where  u  =  (u,,  U2,  u^)  is  a 
unit  vector  along  the  line  and  =  (uj,  ^2,  is  the 
moment  of  u  about  the  origin;  and  planes  are  represented  by 
homogeneous  coordinates  of  the  form  A  =  (a,  a^)  where 
a  =  (fli,  ^2?  ^3)  veaor  normal  to  the  plane  and 

is  the  distance  of  the  plane  from  the  origin. 


12  Spatial  Displacements  of  Simple  Features.  Let  P 
and  P  denote,  respectively,  two  coordinate  systems  repre¬ 
senting  two  distinct  positions  of  a  rigid  body  in  space.  The 
displacement  from  P  to  P  is  a  combination  of  a  rotation  and 
a  translation.  The  rotation  about  an  axis  5  with  an  angle  6  is 
given  by  a  set  of  four  numbers,  q  =  ^3»  ^4)  (call^^i 

the  Elder  parameters)  where: 


(e\  [6] 

^3  =  S3sinl-l,  ^4-cos|^-j, 


and  s  =  +  5262  +  is  a  unit  vector  along  5.  The 

translation  is  given  by  another  set  of  four  numbers,  q  =  (^1, 
?2.  93>  €4)  where 


and  d  -  die ^  +  ^262  +  ^363  is  the  translation  vector  from  P 
to  P.  The  pair  of  vectors  q  and  q®  satisfy  the  relation 

qV  =  qi^i  +  €2^2  +  93^3  +  949S  =  0  (2) 

and  are  called  the  Stiufy  vectors  (Bottema  and  Roth,  1979). 
Note  that  the  Euler  parameters  are  normalized  such  that 

q’^q  =  q?  +  92  +  +  ^4  =  1-  (3) 

A  special  algebra,  called  the  Clifford  algebra,  is  used  by 
Ravani  and  Ge  (1993)  to  manipulate  the  study  vectors  (q,  q^) 
and  the  coordinates  of  simple  geometric  features  of  points, 
lines,  and  planes.  Clifford  algebra  is  an  associative  algebra 
intrc^uced  by  Clifford  (1876)  for  manipulating  not  only  vec¬ 
tors  (representing  points)  but  also  multivectors  (representing 
lines  and  planes).  In  this  algebra,  q  is  represented  by  an  even 
element  (a  combination  of  a  bivector  and  a  scalar): 

q  =  (JiCjCz  +  ijCiCs  +  5362*1) (4) 

where  the  geometric  products  of  the  unit  vectors  are  defined 
as: 

emen=-6nem.  for  m#n,  ef  =  e|  =  e|  =  1. 

In  terms  of  the  geometric  product,  the  definition  (1)  becomes 
q»  =  (l/2)dq.  (5) 
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Rg.  1  A  point,  a  line,  and  a  plane  at  two  different  positions  P  and 
P 


The  “square”  of  a  vector  x  =  +  ^262  +  ^363,  x^,  is  sim 

ply  XX  =  Ixp  where  Ixl  =  (x^  +  ^2  +  is  the  Euclidean 
norm  of  x.  More  details  about  Clifford  algebra  can  be  found 
in  Hestenes  (1986)  and  McCarthy  (1990). 

At  the  position  P,  the  homogeneous  coordinates  of  a 
point,  a  line,  and  the  polar  point  of  a  plane  are  denoted  by 
lx,  1),  (u,  u^),  and  (a,  a^),  respectively;  and  at  the  position  P, 
the  homogeneous  coordinates  of  the  corresponding  features 
are  denoted  by  (x,  1),  (u,  u®),  and  (a,  a^),  respectively,  see  Fig. 
1.  The  displacements  (q,  q^)  of  these  features  from  P  and  P 
are  given  by  the  following  multivector  equations  (Ravani  and 


Ge,  1993). 

For  points  features,  we  have 

xq  -  qx  -  2q®  =  0;  (6) 

For  line  features,  we  have 

i  uq  -  qu  =  0;  (7) 

u®q  -  qu“  +  uq®  -  q°u  =  0;  (8) 

For  plane  features,  we  have 

aq  -  qa  =  0;  (9) 

^4  -  «4  -  2(qa)  •  q®  =  0.  (10) 


The  product  of  a  vector  x  =  x-^e^  +  X2e2  +  X3e3  with  the 
even  element  q  defined  by  (4)  can  be  conveniently  expressed 
in  the  matrix  form 

xq  =  lj;'"]q.  qx=[^"]q, 

where  and  [x~]  are  4X4  skew-symmetric  matrices 
given  by 


combined  together  to  form  the  sum  of  squares  of  errors  for 
all  three  types  of  features. 

2A  Sum  of  Squares  of  Errors  for  Point  Features.  Let 
the  coordinate  of  n  points  at  the  two  positions  P  and  P  be 
given  by  the  Cartesian  vectors  Xj  and  Xj,  where  f  =  1,  2,  . . . , 
n.  The  displacement  of  each  of  these  point  features  is  given 
by  the  multivector  equation  of  the  form  (6).  Due  to  measure¬ 
ment  error,  it  is  impossible  to  find  q  and  q°  such  that  (6)  is 
satisfied  for  each  point.  Instead  there  will  be  in  general  a 
residual  error: 

E;  =  x,q  -  qx,  -  2q®.  (13) 

An  overall  measure  of  the  error  as  a  function  of  q  and  q^, 
denoted  by  £^(q,  q°),  is  given  by  the  sum  of  squares  of  these 


£4(9.  q®)  =  ZErE,- 
Expand  the  above  to  obtain 

Ex(q>q®)  =  -2  E  (x,q)  •  (qx,)  -  4n(x^q  -  qx,)  •  q° 


+  4«q®  •  q®  +  E  +  E  k  '  q 


1  B  J  n 

Xc  =  -  E  =  i:  E  X/ 


are  the  centroids  of  two  systems  of  position  vectors  and  [xJ, 
lx, -I  are  the  Euclidean  norms  of  Xj,  x^,  respectively.  In  matrix 
form,  the  error  function  is  given  by 

Ex(q.q®)  =  q^[^4]q  +  q’’[5x]q°  +  4n(q®)V  (i5) 
where  [v4^]  is  a  4  X  4  symmetric  matrix  given  by 

M  =  2 E  [xt][x7]  +  ( E s.f  +  E (16) 

1-1  f=l  / 

and  [BJ  is  a  4  x  4  skew-symmetric  matrix  given  by 


[5j=-47,([x-]-[xr]).  (17) 

In  the  above,  [/]  denotes  the  4  X  4  identity  matrix;  the 
skew-symmetric  matrices  [x^],  [x^],  [xf],  and  [x~]  are 

defined  by  (11). 


The  product  of  these  two  matrices  commutes,  i.e.,  [x~][x'^] 
=  [x‘^][x“].  The  product  of  a  vector  Xr  an  even  element  q, 
and  another  vector  x  is  given  by 

xqx  =  -[x-"][x-]q,  (12) 

where  [x'*’]  is  obtained  from  [x*^]  by  replacing  x  with  x. 

2  Sums  of  Squares  of  Errors 

This  section  uses  Clifford-algebra  equations  for  displace¬ 
ments  of  points,  lines,  and  plane  features  to  obtain  the  sums 
of  squares  of  errors  for  these  features.  The  results  are  then 


22  Sum  of  Squares  of  Errors  for  line  Features.^  Let  the 
coordinates  of  m  lines  at  the  two  positions  P  and  P  be  given 
by  the  Pliicker  vectors  (u^,  u^)  and  (u^,  u^),  respectively,  where 
z  =  1,2,  m.  The  displacement  of  a  line  (u,  u°)  from  P  to 
P  is  given  by  (7)  and  (8).  Since  the  position  data  are  imper¬ 
fect,  there  will  be  errors  in  both  orientation  and  position: 

Eo/  =  -  qu, 

and 

Ep/  =  u/q  -  qu?  +  “zq°  q°Ui-  (i8) 

An  overall  measure  of  the  orientation  error  is  given  by 

m 

Eo(q)  =  E  E«;  •  E^; 

which  can  be  expanded  to  obtain 

m  I  m  m  \ 

Eo(q)  =  -2  E  («,q)  •  (q«/)  +  E  +  E  l“,P  q •  q- 

r*l  \i=l  i  =  l  / 

In  matrix  form  the  equation  above  is  given  by 

■E^o(q)  =  q^[^o]q  (i9) 
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where 


M  =  2j:[un[u-]+  Eiu,p+ Etu,n[/]  (20) 


i=l 


/=1 


is  a  4  X  4  symmetric,  matrix. 

An  overall  measure  of  the  position  error  is  given  by 

/=1 


which  can  be  expanded  and  put  into  the  following  matrix 
form 


£,(q.q“)  =  q"[^^]q  +  q^^Jq®  +  (q»)^[CjqO. 

The  coefficient  matrices  [A^],  [Q]  are  symmetric  matrices 
given  by 

[■^,1- 21;  [(“?)*][(«»■] 

i=  1 

+  (|u?p  +  Ki,P  +  |u?P  +  |u/)[/], 

[CJ  =  2£  [«-][«-]. 

and  the  matrix  [B„]  has  both  symmetric  and  skew-symmetric 
parts,  [jB^]  and  [B_],  which  are  given  by 

t«.i  -  2  |;J[(i!)*][«r)  +  [in[K)']), 

[s.)  -  -21: ([(!:?)*][«*]  +  [(»?)-][.,-]). 

1 

An  overall  measure  of  both  the  position  and  orientation 
error  is  given  by  the  following  combination: 

£.(q.q°)  =/3£o(q)+£^(q,q°) 

=  q’'[^u]q  +  q^[-BJq°  +  (q°)’'[cjq*  (21) 

where  is  a  positive  weight  and  [AJ  =  ^[AJ  +  [Apl 

23  Sum  of  Squares  of  Errors  for  Plane  Features.  Let 
A,-  =  (a,-,  a^)  and  A,.  =  (a,  d^)  (z  =  1,  2,  be  the  coordi¬ 

nate  of  k  planes  at  P  and  P,  respectively.  Similar  to  the 
measure  of  orientation  errors  of  line  features,  an  overall 
measure  of  the  orientation  errors  of  the  plane  features  is 
given  by  q’^lAjq  where  [Aj  is  obtained  from  (20)  by  replac¬ 
ing  u^,  u,-  with  a^,  a,-,  respectively. 

The  errors  in  the  locations  of  the  plane  features  are  given 
by  (10)  as 

e,-  ==  -  a^.  -  2(qa,.)  •  q®. 

The  square  of  the  errors  of  this  form  results  in  a  error 
function  that  is  quartic  in  (q,  q°),  which  is  difficult  to  handle 
for  the  least  squares  solution  method  proposed  by  this  paper. 
For  this  reason,  we  use  instead  the  points  and  lines  of 
intersection  among  the  given  planes  to  determine  the  loca¬ 
tion  errors  of  the  planes.  In  this  way,  the  sum  of  squares  of 
errors  for  plane  features  is  reduced  to  a  combination  of  the 
sums  of  squares  of  errors  for  point  and  line  features.  For  the 
rest  of  this  paper,  we  focus  our  attention  to  the  least  squares 
solution  for  computing  a  spatial  displacement  from  redun¬ 
dant  point  and  line  features. 

2.4  Sum  of  Squares  of  Errors  for  Points  and  Lines. 
Combining  the  results  in  Sections  2.1  and  2.2,  we  obtain  the 
sum  of  squares  of  errors  for  combinations  of  points  and  lines: 

£(q.q®)  =  a^E^(q,q°)  +  a„£„(q,q°) 

where  a^,  are  positive  weights.  The  substitutions  of  (15) 
and  (21)  into  the  above  yields 

£(q.q“)  =  q^[^]q  +  q^[BV  +  (q“)^[C]q'>  (22) 


where  [A]  and  [C]  are  4  X  4  ^mimetric  matrices  given  by 
[A]  =  +  a,[A,l  [C]  =  4a,n[/]  +  aJCj, 

respectively,  and  [5]  is  a  general  4x4  matrix  given  by 

It  is  clear  that  the  grand  sum  of  squares  of  errors  for 
combinations  of  points,  lines,  and  planes  can  be  put  into  the 
same  form  as  (22). 

The  problem  of  computing  spatial  displacements  from 
redundant  position  data  of  points,  lines,  planes,  and  their 
combinations  is  reduced  to  that  of  minimizing  £(q,  q°)  by 
variation  of  q  and  q°  subject  to  the  constraints  (3)  and  (2), 
i.e., 

q^q  =  1,  qV  =  0.  (23) 


3  An  Algorithm  for  the  Least  Squares  Solution 

This  section  deals  with  the  constrained  minimization  prob¬ 
lem  in  the  preceding  section  using  Lagrange  multipliers  and 
develops  a  simple  algorithm  for  solving  the  least  squares 
problem. 

3.1  Conditions  for  Minimum.  The  Lagrangian  function 
to  be  minimized  is  as  follows 

L(q.q«,  A,  A»)  =  ^(q.qO)  -  A(q’-q  -  1)  -  A»(qV). 

(24) 

The  conditions  for  minimum,  i.e.,  dL /dq  =  0  and  dL /dq^ 
=  0,  lead  to 


2[^]q  +  [B]q^  ~  2Aq  -  -  0  (25) 

and 

[£]’'q  +  2[C]q®- AOq  =  0.  (26) 

These  two  vector  equations  are  linear  in  q  and  q^.  Together 
with  the  two  quadratic  constraint  equations  q^q  =  1  and 
q^q  ==  0,  they  constitute  a  system  of  ten  equations  needed  to 
solve  the  ten  unknowns,  q,  q^.  A,  and  A®. 

To  solve  these  ten  equations,  we  first  represent  the  best 
translation  q^  in  terms  of  the  best  orientation  q.  In  view  of 
(26),  we  obtain 


q«  =  -(l/2)[C]-‘[fi]^  +  (l/2)[C]-'qA'>.  (27) 
Substitute  (27)  into  (25)  to  obtain  the  following  equation: 

4lA’]  q  -  4Aq  +  A“[5']q  -  (A»)"[C]-'q  =  0,  (28) 
where  [A']  and  [B'\zit  symmetric  matrices  given  by 


[A]  =  [A]  -  (1/4)[S][C]-*[S]^  (29) 

[F]  =  [B][C]-^  +  [C]-*[5f .  (30) 

The  Lagrange  multipliers,  A  and  A°,  in  (28)  are  obtained  as 
homogeneous  functions  of  q: 

q^[C]-^[Bfq 


a0  = 


A  = 


q"[C]  -q 

q^[^]q  q’'[£][C]-'[Brq 


(31) 


q^q 


4q’‘q 


+  A 


y[B][C]-^q 


4q’'q 


(32) 


The  substitution  of  (31)  and  (32)  into  (28)  results  in  four 
nonlinear  equations  in  the  components  of  q  and  therefore,  in 
general,  the  best  orientation  q  does  not  seem  to  exist  in 
closed  form.  The  best  translation  q®,  however,  is  a  rational 
cubic  function  of  the  best  orientation  q,  in  view  of  (27)  and 
(31). 


3.2  A  Simple  Algorithm.  We  now  develop  a  simple  al- 
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gorithm  for  computing  the  best  orientation  q.  We  first  rewrite 
(28)  in  the  following  form: 

[D(A°)]q  =  Aq  (33) 

where  [D(A°)]  is  a  symmetric  matrix  given  by 

[D(\0)]  =  [A]  +  (1/4)A«[B']  -  (l/4)(A«)"[C]-^ 

(34) 

This  indicates  that  the  best  orientation  q  corresponds  to  one 
of  the  unit  eigenvectors  of  the  4x4  symmetric  matrix 
[D(A®)],  provided  that  A^  is  known.  Since  there  are  four  real 
eigenvectors  for  the  real  symmetric  matrix  [Z)(A®)],  there  are 
four  local  optimal  orientations  for  the  least  squares  problem. 
The  global  optimal  orientation  q  is  the  eigenvector  associated 
with  the  least  eigenvalue  of  [Z>(A®)],  see  Appendix  A. 

The  eigenvalues  of  [i)(A°)]  are  the  solutions  of  the  quartic 
characteristic  equation 

det[D(A“)  -  A/]  =  0. 

The  eipnvalues  can  be  obtained  in  closed  form.  Once  the 
least  eigenvalue  A  is  selected,  the  corresponding  unit  eigen¬ 
vector  q  is  obtained  by  solving  the  homogeneous  equation 

[D(A0)  =  A/]q  =  0.  (35) 

Appendix  B  provides  a  simple  solution  method  for  (35)  using 
vector  wedge  product. 

The  solution  method  for  obtaining  the  eigenvalues  and 
eigenvectors  for  the  4  x  4  real  symmetric  matrix  (34)  is  the 
core  of  the  following  simple  algorithm  for  determining  the 
best  orientation  q.  The  algorithm  proceeds  as  follows: 

(1)  Compute  two  constant  symmetric  matrices  [A']  and 
[B'l  using  (29)  arid  (30),  respectively. 

(2)  Select  an  initial  orientation  as  represented  by  a  unit 
quaternion  q. 

(5)  Compute  the  value  of  A°(q)  using  (31)  and  then  deter¬ 
mine  the  matrix  function  [I>(A®)]  using  (34). 

(4)  Find  the  least  eigenvalue  A  and  the  associated  unit 
eigenvector  of  the  matrix  [Z)(A^)]. 

(5)  Compute  <f>  =  cos"Hq^  ■  q)  and  let  the  new  q  be 

(<1  +  <Jd)/(2cos(<;>/2)), 

(6)  If  the  norm  |^|  is  sufSciently  small,  stop;  otherwise  go 
to  step  3  and  repeat  the  procedure. 

The  issue  of  choosing  an  initial  orientation  for  the  above 
algorithm  will  be  discussed  in  the  next  section. 


4  Special  Cases 

This  section  discusses  special  cases  of  the  least  squares 
problem  when  only  point  features  or  line  features  are  consid¬ 
ered.  The  kinematic  and  geometric  insight  gained  by  studying 
these  special  cases  is  then  incorporated  into  the  choice  of  an 
initial  orientation  for  the  least  square  algorithm  developed  in 
the  preceding  section. 

4.1  Point  Features.  When  only  n  point  features  are 
considered,  the  matrices  [A],  [B],  and  [C]  become  [A^], 
and  4n[I],  respectively.  The  matrix  given  by  (17)  is 
skew-symmetric  and  therefore  the  Lagrange  multiplier  A^ 
given  by  (31)  is  no  longer  a  function  of  q  but  equals  to  zero, 
i.e.,  A^  =  0.  Thus  the  least  square  algorithm  yields  the  best 
orientations  q  in  one  step  and  no  iteration  is  required.  The 
solution  is  of  closed  form  and  is  summarized  as  follows. 

The  best  orientation  q  is  the  unit  eigenvector  associated 
with  the  least  eigenvalue  of  the  matrix  [A'^]  =  [AJ 
eigenvalue  problem  is  given  by 

[^x]q+^[Bj[Bjq  =  Aq.  (36) 


This  is  essentially  the  same  result  as  that  provided  by  Horn 
(1987). 

The  best  translation  given  by  (27)  is  reduced  to 

q«  =  (l/8n)[5jq  =  i([i^^]  -  [^;])q.  (37) 

This  can  be  recast  in  the  multivector  form  as 

q°  =  (l/2)(x,q  -  q,,)  (38) 

where  and  are  the  centroids  of  two  systems  of  position 
vectors.  This  indicates  the  best  translation  q°  is  given  by  the 
translational  offset  of  the  two  centroids,  a  well-known  result 
provided  by  Spoor  and  Veldpaus  (1980)  and  Horn  (1987). 

Note  that  (36)  can  be  written  in  the  following  multivector 
form  after  the  substitution  of  (16)  and  (17): 

m 

-2  E  (x;qx;  “  Xcq^c) 

i«l 

+  E  (R/  +  lx;l^  -  -  lxcl^)q  =  M-  (39) 

1 

This  will  be  used  later  for  comparison  with  the  result  ob¬ 
tained  from  an  averaging  process. 

4.2  An  Averaging  Process.  It  turns  out  the  closed-form 
solution  above  is  related  to  the  solution  obtained  from  an 
averaging  process.  The  residual  error  for  each  point  is  given 
by  (13),  Sum  all  these  errors  to  obtain 

n 

E  E;  =  x^q  -  qx^  -  2q“  (40) 

where  x^  and  x^  are  the  centroids  given  by  (14).  This  means 
that  the  best  translation  q°  makes  the  sum  of  errors,  jE,, 
vanishes. 

The  substitution  of  (38)  into  (13)  yields 

Ei  =  M  -  qyf  (41) 

where  vectors  y,-  and  y.  are  measured  with  respect  to  their 
respective  centroids: 

fi  =  X,  ~  x„  y,  =  X,  --  X,.  (42) 

Equations  (41)  and  (42)  lead  to 

E  (y,E,  -  E,y,)  =  f  (|y/  +  |y/)q  -  2  E  y,qy,.  (43) 

/-I  i-1  f-1 

Substituting  (42)  back  into  (43)  and  in  view  of  (14),  we 
obtain 

E  (x,E,  -  E,.x,.) 

=  E  (tx/  +  IX/l^  -  IXcl^)q  -  2  E  (Xj-qX;  =  x^qXc)- 

i=l  i*l 

(44) 

The  comparison  of  (44)  with  (39)  results  in  the  following 
interesting  result 

n 

E  (x,E,.  -  E,x,-)  =  Aq.  (45) 

In  the  ideal  case  when  all  data  are  perfect,  i.e.,  E;  =  0  for  all 
f  =  1,  2,  . . . ,  /2,  Eq.  (45)  yields  A  =  0,  Ihis  agrees  with  the 
result  (48)  in  Appendix  A 

4.2  Line  Features.  When  only  line  features  are  consid¬ 
ered,  the  Lagrange  multiplier  A®  is  a  function  of  q  since  [BJ 
is  not  skew-symmetric  and  [CJ  is  not  a  multiple  of  [/]. 
Therefore,  the  closed-form  solution  to  the  problem  of  simul¬ 
taneously  minimizing  the  orientation  and  position  errors  does 
not  seem  to  exist. 

However,  in  the  special  case  when  all  lines  are  represented 
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Table  1  The  position  data  of  six  points  and  six  fines  at  two  distinct 
configurations 


At  the  initial  configuration  j 

Points  (x,) 

Lines  (u,) 

Lines  (uj') 

(8.0.  6.0.  2.0) 

■CfrHatlPtmiBiEiaf 

(-8.3797.  0.4647.  2.06S7) 

(2.0.  11.0.  -6.0} 

(-2.4020.  -4,8W0.  2.7061) 

(4.0.  7.0.  9.0) 

(-0.1937.  -0.1549.  0.96S7) 

(4.998649.  9.493558.  2.518699) 

(4.4.  3.0.  0.0) 

(0.8899.  -0.1001.-0-4449) 

(2.0023.  21.8033.  -0.9010) 

.\i  the  displaced  configuration  [ 

(-7.3801.  14.8362.  -5.6198) 

(0.7386.  0.1980.  -0.6443) 

(-6.0525.  -7.3061.  -9.1842) 

(-8.2420.  8.3324.  -9.2195) 

(0.8591,  0.4543.  -0.2353) 

(5.4624.  -13.7875.  -6.6742) 

(-8.5317.  12.3697.  5.2240) 

(0.5169.  0.5520.  -0.6542} 

(-0.6225.  -13.6783. -12.0351) 

(-2.7462.  12.0418.  -7.7921) 

(0.7240.  0.0162.  -0.6895) 

(-6.9688.  -1.8993.  -7.3627) 

(-0.3589.  11.5897.  -8.1025) 

(0.6876.  -0.2805.  -0.6697) 

(1.2463.-1.2115.  1.7870) 

(-9.5559.  10.3030.  -4.7979) 

by  points  and  unit  vectors  rather  than  the  Pliicker  coordi¬ 
nates,  the  least  squares  solution  can  be  found  in  the  closed 
form.  In  this  case,  the  equations  for  position  errors  (18)  are 
replaced  by  point  equations  of  the  form  (13).  The  resulting 
least  squares  problem  is  essentially  the  same  as  that  of  point 
features  and  a  closed-form  solution  can  be  obtained. 

4.4  Choice  of  Initial  Orientation.  Now  turn  back  to  the 
general  least  squares  problem  of  computing  spatial  displace¬ 
ments  from  combinations  of  point,  line,  and  plane  features. 
A  good  choice  of  initial  orientation  is  the  unit  eigenvector 
associated  with  the  least  eigenvalue  of  the  matrix  [A']  given 
by  (29).  This  orientation  is  “quasi-optimal”  in  the  sense  that 
it  minimizes  the  combination  of  errors  in  point  measure¬ 
ments,  and  orientation  errors  in  line  and  plane  measure¬ 
ments. 

4.5  An  Example.  \We  now  present  an  example  problem 
of  computing  a  spatial  displacement  from  the  position  data  of 
six  points  and  six  lines  at  two  distinct  configurations.  To 
generate  the  position  data  for  these  features,  we  first  select  a 
spatial  displacement  (q,  q®)  and  the  position  data  of  the  point 
and  line  features  at  the  initial  configuration.  We  then  gener¬ 
ate  the  position  data  of  the  given  features  at  the  displaced 
configuration  using  forward  kinematics  equations.  The  rota¬ 
tional  component  of  the  spatial  displacement  is  given  by  a 
unit  quaternion  q  -  (0.466609,  0.784751,  0.190885,  0.360561); 
the  translational  component  of  the  displacement  is  given  by 
the  vector  d  =  (-10.0,  5.0,  -5.0)  which  can  be  used  to 
compute  q®  using  (5).  Tlie  position  data  of  the  point  and  line 
features  before  and  after  the  spatial  displacement  are  given 
by  Table  1). 

We  have  implemented  the  algorithm  presented  in  Section 
3.2  and  have  found  that,  for  the  position  data  given  in  Table 
1,  the  algorithm  yields  the  given  displacement  q  =  (0.4666, 
0.7847,  0.1908,  0.3605)  and  d  =  (-10.0,  5.0,  -5.0)  in  one 
step.  This  is  expected  since  all  position  data  are  perfect.  To 
see  how  well  the  algorithm  handles  imperfect  position  data, 
we  then  add  10  percent  random  error  to  the  position  data  of 
the  geometric  features  at  the  displaced  configuration  and 
find  that  the  algorithm  converges  to  a  least  squares  solution 
in  no  more  than  four  iterations.  For  the  imperfect  position 
data  shown  in  Table  2,  the  algorithm  converges  in  two 
iterations  and  the  result  is  presented  in  Table  3.  In  obtaining 
the  result,  we  used  weights  /3  =  1.0,  =  1.0,  and  =  100.0 

Adjustment  of  these  weights  allows  for  change  of  accuracy 
requirement  of  a  given  set  of  the  geometric  features. 

5  Automatic  Generation  of  Natural  Frames 

This  section  presents  simple  geometric  algorithms  for  au¬ 
tomatic  generation  of  body-fixed  coordinate  frames  (called 
natural  frames)  from  geometric  features  of  points,  lines, 
planes,  and  their  combinations.  This  is  important  in  applica¬ 
tions  such  as  world  model  calibration  for  off-line  robot  pro¬ 


Table  2  The  imperfect  position  data  of  six  points  and  six  lines  at 
the  displaced  configuration 


At  the  displaced  configuration  (with  10/t  random  error)  j 

(-7.0906.  14.6219.  -5.9716) 

tEBsamaamm 

(-7.6305.  7.6276.  -10.0261) 

(0.8607.  0.4553.  -0.2274) 

(0.4758.  0.5808.  -0.6604) 

(-0.4700.  -13.49SS.  -12.2117) 

(-9.1206.  9.6892.  -5.1000) 

(-0.6743.  0.7303.1  -0.1088) 

(14.6192.  14.9200.  9.5499) 

Table  3  Computing  a  spatial  displacement  from  imperfect  position 
data 


Iterations 

The  least  eigeax-alue 

The  best  orientation 

1 

23.078678 

(0.462546.  0.787129.  0.192708.  0.359642) 

2 

23.085560 

(0.462549.  0.787127.  0.192709.  0.359642) 

The  best  translation 

d  =  2qV'  =  (“10.061083.4.947477.  -4.9”74046) 

gramming,  since  the  computer  model  of  each  object  has  to  be 
automatically  updated  from  the  sensory  interaction  with  the 
actual  object,  see  Ravani  (1988).  Automatic  construction  of 
natural  frames  from  point  features  have  been  discussed  pre¬ 
viously  by  Angeles  (1986)  where  he  used  mass  distribution  to 
locate  a  natural  frame. 

In  what  follows,  the  orientation  of  a  natural  frame  is 
represented  by  a  set  of  three  orthonormal  unit  vector,  de¬ 
noted  by  e'l,  €2  and  €3,  and  the  origin  of  the  natural  frame  is 
represented  by  a  vector  denoted  by  p.  Tliis  section  presents 
simple  formulas  for  determining  the  origin  p  and  the  coordi¬ 
nate  axes  Cj,  63.  The  e'2  axis  is  then  given  by  62  =  63  X  e\. 

5.1  Oriented  Features.  An  oriented  line  is  given  by  the 
signed  normalized  Pliicker  coordinates  u  =  (u,  u^).  By  the 
signed  Pliicker  coordinates,  we  mean  u  and  —  (i  represent 
two  distinct  lines  having  the  same  position  in  space  but  with 
opposite  sense  of  orientation.  An  oriented  plane  is  given  by 
the  plane  vector  A  —  (a,  a^)  where  a  is  the  unit  normal  to  the 
plane  and  is  the  distance  from  the  origin  to  the  plane.  The 
two  plane  vectors  A  and  -A  represent  two  planes  with  the 
same  position  but  opposite  sense  of  orientation. 

In  what  follows,  the  notion  of  oriented  features  is  also 
applied  to  point  features  in  order  to  determine  uniquely  an 
oriented  plane  from  three  noncoUinear  points.  An  oriented 
point  is  represented  by  the  homogeneous  coordinates  of  the 
form  X  =  (x,  a)  where  x  is  the  Cartesian  vector  of  the  point 
and  a  —  1  (cr  =  —1)  denotes  the  positive  (negative)  orienta¬ 
tion  of  the  point  (Stolfi,  1989;  Ge  and  Ravani,  1993).  Simple 
formulas  for  computing  the  coordinate  axes  €3,  and  the 
origin  p  are  presented  in  Table  4. 

52  Nonoriented  Features.  For  all  the  cases  presented 
in  Table  4,  if  simple  geometric  features  are  specified  without 
information  about  the  sense  of  orientation,  an  additional 
simple  feature  may  be  needed  to  construct  a  natural  frame. 
Table  5  contains  all  possible  combinations  of  nonoriented 
features  of  points,  lines,  and  planes  required  to  specify  a 
natural  frame  completely. 

Let  us  first  consider  the  construction  of  a  natural  frame 
using  nonoriented  point  features.  In  this  case,  the  direction 
of  the  unit  vector  63  is  given  by  =  (x  x  y  H-  y  x  z  -H  z  x 
x)  where  x,  y,  and  z  denote  the  vectors  of  the  Cartesian 
coordinates  of  three  given  points  (Table  4).  The  sense  of 
direction  for  63,  however,  can  be  specified  with  additional 
information  about  the  order  of  the  three  points  (resulting  in 
oriented  points)  or  with  an  additional  point  (denoted  as  t) 
which  is  not  coplanar  with  the  three  given  points.  Using  the 
point  t,  the  sense  of  direction  for  63  is  given  by  sign  of 
(t  “  p) '  %no  where  p  =  (x  +  y  -h  z)/3. 

All  other  cases  listed  in  Table  5  can  be  reduced  to  four 
noncoplanar  points  by  obtaining  one  or  more  of  the  follow¬ 
ing:  U)  the  intersection  point  between  a  line  and  a  plane;  (2) 
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Table  4  Formulas  for  natural-frame  generation  using  oriented  fea¬ 
tures 


Features 

l  e'j  =  g/|g|,  e'l  =  h/ihl,  and  p  | 

Three  points 
X  =  (x,<7i) 

Y  =  (y,  <Ty) 

Z  =  (z,  C:) 

g 

Normal  to  the  plane 
spanned  by  the  three  points 
g  =  (^z<7j,cr,)gno  where 
gno  =  xxy-hyx2-hzxx 

h 

h  =  z  —  p 

P. 

p  =  (x  +  y  +  z)/3 

Two  lines 
u  =  (u.u'>) 
v=  (v,vO) 

g 

g  =  u  XV  is  along  the  common 
perpendicular  w  =  (w,w^ 
where  w  =  (u  x  v)/ 1  u  x  v  I 
and  w°=:(uxv°  +  u°xv)/luxvl 

h 

h  =  u 

P 

p  =  (c^  4-  Cv)/2  is  the  midpoint  of  the 
two  feet  of  perpendicular 

Cu  =  u  x  u°  -i-  [u  •  (w  X  w’^)]u 
c,.  =  V  X  v®  +  [v  •  (w  X  wO)]v 

Three  planes 
A  =  (a.  <24) 
B=(b,64) 

C  =  (c,  C4) 

g 

g  =  b 

h 

h  =  a  X  b 

Point  of  intersection 

P  =  P//[c  *  (a  X  b)]  where 
p;  =  C4{a  X  b)  -h  64 (b  X  c)  4-  04(0  x  a) 

A  point 
and  a  line 
X=(x,a,) 
u=(u,u0) 

— 

g 

Normal  to  the  plane  spanned 
by  the  point  and  the  line 
g  =  u°  —  X  X  u 

h 

h  =  u 

P 

The  feet  of  perpendicular  from  X  to  u 
p  =  u  X  4-  (u  '  x)u 

Two  points 
and  a  plane 

X  =  (x,<rj) 

Y  =  (y,  ay) 

A  =  (a,  04) 

g 

g  =  a  X  (y  -  x) 

h 

h  =  a 

i 

P 

p=(x^y)/2 

A  point  and 
two  planes 

X  -  (x,0r,) 

A  =  (a,  04) 
B=(b,64) 

g 

g  =  ax  b 

h 

h  =  a 

P 

p  =  x 

A  line  and 
a  plane 
u  =  (u,  vP) 

A  =  (a.  04) 

g 

g  =  ax  u 

h 

h  =  a 

P 

Point  of  intersection 
p  =  (a4U  -f-  a  X  u)/(a  •  u) 

Table  5  Minimum  number  of  nonoriented  features  for  generating  a 
natural  frame 
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1 

3 

2 

0 

0 

2 

0 

1 

1 

the  intersection  point  of  three  planes;  (5)  the  feet  of  perpen¬ 
dicular  from  a  point  to  a  line;  {4)  and  the  feet  of  common 
perpendicular  to  two  lines.  The  formulas  for  these  geometric 
operations  are  found  in  Table  4. 

Conclusions 

In  this  paper  we  have  presented  a  least  squares  solution  to 
the  problem  of  computing  spatial  displacements  from  a  re¬ 
dundant  set  of  simple  features  of  points,  lines,  planes,  and 
their  combinations.  With  the  aid  of  a  Clifford  igebra,  the 
best  orientation  is  obtained  by  an  iterative  procedure  which 
involves  finding  the  eigenvector  associated  with  the  least 
eigenvalue  of  a  4  X  4  real  symmetric  matrix.  The  best  trans¬ 
lation  is  a  simple  rational  cubic  function  of  the  best  orienta¬ 


tion.  It  is  also  discussed  that  the  closed-form  solution  exists 
for  the  best  orientation  when  all  features  are  represented  by 
points  and  unit  vectors.  In  addition,  simple  algorithms  are 
provided  for  automatic  generation  of  body-fixed  coordinate 
frames  from  various  feature  information.  It  is  hoped  that  the 
results  in  this  paper  will  facilitate  the  use  of  measuring 
systems  for  determining  spatial  displacements  that  can  han¬ 
dle  not  only  point  features  but  also  line  and  plane  features. 
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APPENDICES 


A:  Absolute  Minimum  of  the  Error  Function 

Let  be  the  best  orientation  and  q®  be  the  best  transla¬ 
tion  in  Clifford-algebra  form.  The  minimum  total  error  is 
obtained  by  substituting  (27)  into  (22): 

EMs)  =  -  \^l[B][Cr[B]\s 

+  (46) 
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This  can  be  rewritten  as 

E{<ls,  q?)  =  qr[^(A®)]q,  +  i?(q„  A®).  (47) 

where  [D(A®)]  is  given  by  (34)  and 

A^)  =  --“(q.)"[^']q.A^  +  (l/2)qnC]'^q,(A^)'. 

The  term  i?(q,,  A°)  vanishes  after  the  substitution  of  (30), 
(31)  and  in  view  of  the  fact 

(qy[^][C]‘^q,  =  (qy[C]-nBfq,- 

Let  Aj  be  one  of  the  eigenvalues  of  the  matrix  [Z)(A^)].  Then 
we  have  q^(D(A®)q  -  Aj  and  consequently 

£(q„q5)-A„  (48) 

This  concludes  that  all  eigenvalues  of  the  matrix  [D(A®)]  are 
non-negative  and  that  the  total  error  £(qj,  q^)  is  at  absolute 
minimum  when  A^  is  the  least  eigenvalue  of  [Z>(A^)].  In  the 
ideal  case  when  all  data  are  perfect,  the  total  error  £(q^,  q®) 
^  0  and  consequently,  A^  =  0. 

B:  Finding  the  Eigenvector 

The  four  eigenvalues  can  be  solved  in  closed  form  from 
the  quartic  characteristic  equation.  Once  the  least  eigenvalue 
A^  has  been  determined,  the  corresponding  eigenvector  is 
obtained  by  solving  the  homogeneous  equation 

[Z)']q,  =  0  (49) 


where  [D']-  [i9(A^)]  -  A J/]  is  a  singular  matrix.  Let  a,  b,  c, 
and  d  denote  the  four  row  vectors  of  the  matrix  [£)'].  Then 
Eq.  (49)  implies 

a-q,  =  0,  b-q,-0,  c  -  q,  =  0,  d*q,  =  0.  (50) 

When  the  least  eigenvalue  A^  is  distinct,  only  three  of  the 
four  equations  above  are  independent  and  the  unit  eigenvec¬ 
tor  q,  is  unique.  Although  any  three  of  these  equations  can 
be  used  to  solve  for  q^,  it  is  numerically  more  stable  for  a 
solution  to  use  all  of  the  four  equations.  We  first  obtain  the 
following  trivector 

Q  =  aAbAc4-aAcAd  +  bAcAd4‘aAbAd 

where  “  A  ”  denotes  the  vector  wedge  product.  Wedge  prod¬ 
uct  generalizes  to  higher  dimensions  the  vector  cross  prod¬ 
uct,  see  Handers  (1963).  The  coordinates  of  this  trivector, 
Q  =  (Qi,  Qiy  Qs*  QaX  can  then  be  normalized  to  yield  the 
unit  eigenvector  q^. 

When  Aj  is  a  double  eigenvalue,  only  two  of  the  four 
equations  (50)  are  independent  and  the  unit  eigenvector  q^  is 
not  unique.  In  fact  q^  traces  out  a  unit  circle.  The  plme 
containing  the  unit  circle  is  given  by 

P  =  aAb  +  aAc  +  aAd  +  bAc-i-bAd-}-cAd. 

Any  vector  in  this  plane  can  be  normalized  to  yield  a  desired 
eigenvector  q^. 

When  Aj  is  a  triple  eigenvalue,  any  unit  vector  perpendic¬ 
ular  toa  +  b  +  c-hdisa  desired  eigenvector. 
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In  this  paper  we  present  a  method  for  obtaining  first-order  decoupled  equations  of 
motion  for  multirigid  body  systems.  The  inherent  flexibility  in  choosing  generalized 
velocity  components  as  a  function  of  generalized  coordinates  is  used  to  influence  the 
structure  of  the  resulting  dynamical  equations.  Initially,  we  describe  how  a  congruency 
transformation  can  be  formed  that  represents  the  transformation  between  generalized 
velocity  components  and  generalized  coordinate  derivatives.  It  is  shown  that  the  proper 
choice  for  the  congruency  transformation  will  insure  generation  of  first-order  decoupled 
equations  of  motion  for  holonomic  systems.  In  the  case  of  nonholonomic  systems,  or 
holonomic  systems  with  unreduced  configuration  coordinates,  we  incorporate  an 
orthogonal  complement  in  conjunction  with  the  congruency  transformation.  A  pair  of 
examples  illustrate  the  results.  Finally,  we  discuss  numerical  implementation  of 
congruency  transformations  to  achieve  first-order  decoupled  equations  for  simulation 
purposes. 


Introduction 

Constrained  multirigid  body  systems  refers  to  systems  of 
interconnected  bodies  and  particles  which  are  subjected  to 
various  motion  constraints.  Such  systems  are  abundantly  rele¬ 
vant  in  engineering  for  modeling  a  wide  variety  of  mechanical 
systems.  Much  attention  has  been  focused  on  formulation 
procedures  to  yield  the  differential  equations  describing  the 
motion  of  multibody  systems  (Crandall  et  al.,  1968;  Gibbs, 
1879;  Gibbs,  1961;  Hartog,  1948;  Huston,  1990;  Kane  and 
Levinson,  1985;  Roberson  and  Schwertassek,  1988;  Scott, 
1988;  Storch  and  Gates,  1989).  In  most  cases  the  resulting 
equations  are  numerically  integrated  to  obtain  trajectories 
characterizing  the  system’s  motion.  In  addition,  the  equations 
of  motion  are  often  analyzed  directly  to  determine  the  nature 
of  the  nonlinear  behavior.  This  paper  demonstrates  a  method, 
using  Kane’s^  equations  (Kane  and  Levinson,  1985),  for  gen¬ 
erating  equations  of  motion  which  are  decoupled  in  the 
highest  derivative  terms.  We  will  refer  to  such  equations  as 
being  first-order  decoupled.  Nonlinear  differential  equations 


*It  has  been  pointed  out  (see  Desloge,  1987  and  Huston,  1987)  that 
these  equations  are  actually  a  particular  form  of  the  Gibbs-Appcll  equa¬ 
tions  (see  Gibbs,  1879  and  1961). 
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of  this  form  are  more  easily  integrated  numerically,  as  well  as 
better  fit  for  analysis.  The  procedure  of  generalized  velocity 
component  selection  based  on  a  congruency  transformation 
(Wade,  1951)  is  developed  and  used  to  achieve  first-order 
decoupled  form  of  the  equations  of  motion. 

Considerable  attention  has  been  placed  on  contending 
with  holonomic  and  nonholonomic,  linear  and  nonlinear  mo¬ 
tion  constraints  on  multirigid  body  systems  (Kamman  and 
Huston,  1984;  Kane,  1972;  Nikravesh  and  Haug,  1983;  Wang 
and  Huston,  1988;  Wehage  and  Haug,  1982;  Wampler  et  al., 
1985;  Xu  et  al,  1990).  A  common  way  to  deal  with  con¬ 
straints  is  to  impose  them  at  an  early  stage  of  the  analysis  by 
reducing  the  set  of  dependent  generalized  coordinates  to  an 
independent  one.  If  ^e  constraints  are  nonholonomic  the 
generalized  coordinate  derivatives  are  reduced  accordingly. 
However,  many  consider  it  to  be  more  effective  to  first 
perform  the  dynamical  analysis  for  the  unconstrained  system, 
and  then  reduce  the  resulting  equations  to  a  consistent  set 
with  the  constraint  equations.  For  example,  Kamman  and 
Huston  (1984),  using  line’s  formulation,  show  that  the  pro¬ 
jection  of  existing  equations  of  motion  onto  an  orthogonal 
complement  yields  the  desired  reduced  equations.  An  orthog¬ 
onal  complement,  say  C,  of  matrix  B  would  satisfy  the  equa¬ 
tion  BC  =  0.  Ben-Israel  and  Greville  (1974)  and  Lawson  and 
Hanson  (1974)  discuss  the  mathematical  significance  of  the 
orthogonal  complement.  The  use  of  the  orthogonal  comple¬ 
ment  to  impose  motion  constraints  is  illustrated  by  Hemami 
and  Weimer  (1981),  Huston  (1990),  Kamman  and  Huston 
(1984),  Wang  and  Huston  (1988),  and  Xu  et  al.  (1990). 
Hemami  and  Weimer  (1981)  use  the  orthogonal  complement 
similarly  for  contracting  equations  generated  by  the  Lagrange 
formulation.  Wampler  et  al.  (1985)  discuss  a  method  for 
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reduction  of  existing  equations  of  motion  subject  to  addi¬ 
tional  constraints  by  recombining  terms  from  the  original 
equations.  A  numerical  procedure  for  imposing  constraints 
during  integration  based  on  generalized  coordinate  partition¬ 
ing  is  presented  by  Nikravesh  and  Haug  (1983)  and  Wehage 
and  Haug  Q982).  One  common  element  of  each  of  these 
methods  is  the  application  of  the  constraints  after  conducting 
the  dynamical  analysis.  Conversely,  we  illustrate  a  method 
that  imposes  constraints  at  an  intermediate  step  of  the  kine- 
matical  analysis  to  enable  decoupling  for  nonholonomic  ^s- 
tems,  or  systems  where  the  holonomic  constraints  are  yet  to 
be  applied. 

The  following  discussion  addresses  the  use  of  generalized 
velocity  component  selection  to  achieve  first-order  decou¬ 
pling  of  multirigid  body  systems.  The  idea  of  the  congruency 
transformation  is  explained  and  utilized  here.  A  simple  ex¬ 
ample  is  conducted  to  clarify  the  procedure  for  finding  the 
congruency  transformation  for  holonomic  systems.  Next,  a 
procedure  is  discussed  for  decoupling  of  systems  with  non¬ 
holonomic  constraints,  or  holonomic  systems  with  unreduced 
configuration  coordinate  descriptions.  The  result,  using  or¬ 
thogonal  complements,  is  a  modified  nonsquare  transforma¬ 
tion  between  generalized  coordinate  time  derivatives  and  a 
reduced  set  of  generalized  velocity  components.  Decoupling 
using  the  orthogonal  complement  is  demonstrated  by  relax¬ 
ing  a  constraint  from  the  first  example.  Lastly,  we  discuss  the 
numerical  application  of  congruency  transformations. 


Levinson,  1985)  are  formed  by  letting  the  sum  of  generalized 
active  forces  and  generalized  inertia  forces  equal  zero.  These 
eauations  are  actually  a  particular  form  of  the  Gibbs-Appell 
equations  (see  Desloge,  1987;  Huston,  1987).  Kane  (1972) 
originally  referred  to  them  as  Lagrange’s  form  of  D’Alembert’s 
principle.  In  these  equations  the  generalized  active  force 
vector  can  be  expressed  as 


F  =  E  +  r;’V,.] 


(2) 


/-I 


where  f  is  the  resultant  active  force  acting  at  the  mass  center 
of  the  fth  body,  and  t,-  is  the  resultant  moment.  V  and  are 
the  partial  velocity  and  partial  angular  velocity  matrices, 
which  are  written  as 


and 


(3Xn) 


Ti  = 
(3  xn) 


<?v,. 


d(x)i 


J  ’ 


(3) 


(4) 


These  partial  velocities  are  taken  with  respect  to  the  general¬ 
ized  velocity  component  vector  u.  The  generalized  velocity 
components  are  a  linear  combination  of  the  first  time  deriva¬ 
tives  of  generalized  coordinates.  For  ease  of  substitution  this 
relation  is  expressed  as 
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First-Order  Decoupled  Equations  of  Motion.  In  the  pro¬ 
cess  of  formulating  equations  of  motion,  for  example,  using 
Kane’s  method,  the  analyst  must  choose  a  linear  combination 
of  first  time  derivatives  of  generalized  coordinates  to  define 
generalized  velocity  components.  These,  in  a  general  form, 
were  initially  introduced  by  Gibbs  (1879,  1961),  but  were 
exploited  in  more  detail  by  Kane  (see,  for  example  Kane 
(1972)  and  Kane  and  Levinson  (1985)).  Kane  and  his  cowork¬ 
ers  have  refeaed  to  these  quantities  as  ^‘generalized  speeds.” 
Some  of  the  problems  for  such  a  term  have  been  pointed  out 
by  Papastavridis  (1992),  who  considers  either  “nonholonomic 
components  of  the  velocity  vector”  or  “quasi-velocities”  to  be 
more  appropriate  terms.  Singh  and  Likins  (1985)  mention  the 
term  “derivatives  of  quasicoordinates”  as  an  alternative.  Here, 
for  lack  of  a  better  name,  we  shall  simply  call  them  generaU 
ized  velocity  components.  The  resulting  equations  of  motion, 
using  the  generalized  velocity  components,  in  matrix  nota¬ 
tion,  are  of  the  form 

Mu  =  g(q,  u)  (1) 

where  u  and  q  are  vectors  of  generalized  velocity  components 
and  generalized  coordinates,  respectively.  M  is  a  matrix  whose 
elements  are  functions  of  generalized  coordinates  and  the 
inertia  properties  of  the  system,  and  g  is  a  nonlinear  vector 
function  of  generalized  velocity  components  and  generalized 
coordinates.  The  technique  presented  here  will  generate 
equations  of  this  form  where  M  can  be  made  diagonal  by 
judicious  selection  of  generalized  velocity  components.  Non¬ 
linear  differential  equations  of  this  form  are  dramatically 
easier  to  numerically  integrate,  as  the  need  for  computing 
M"'  at  each  iteration  is  eliminated.  In  addition,  performing 
various  analyses  of  nonlinear  behavior  tests  for  stability, 
nature  of  critical  points,  chaos,  etc.  is  facilitated  if  the 
system  equations  are  written  in  state  plane  form.  A  diagonal 
M  matrix  in  Eq.  (1)  satisfies  this  requirement. 

Kane’s  Equations.  Consider  a  system  of  p  rigid  bodies 
whose  configuration  can  be  described  completely  by  the  set 

of  n  generalized  coordinates  (qi,  Si*  *  •  •  *  *> 

q  -  [^1.  ^2.  •  •  -  equations  (Kane  and 


q  -  Tu.  (5) 

We  shall  refer  to  the  matrix  T  as  the  rate  transformation 
matrix.  Since  we  are  considering  holonomic  systems  here,  T 
is  a  square  matrix  of  order  n.  The  generalized  inertia  force 
vector  is  written  as 

F*  =  -  E  r)^H,]  (6) 

i-1 

where  a,-  is  the  mass  center  acceleration  of  the  tth  body  and 
H,  is  the  time  rate  of  change  of  angular  momentum  of  body  / 
with  respect  to  the  Newtonian  reference  frame.  Finally,  n 
dynamical  equations  of  motion  are  obtained  by  letting  the 
vector  sums  from  Eqs.  (2)  and  (6)  equal  the  zero  vector. 

F  +  F*  =  0  (7) 

Equation  (7)  is  the  matrix  form  of  the  so-called  Kane’s 
equations.  It  should  be  pointed  out  that  in  this  paper,  as  with 
other  references  on  constrained  multibody  ^sterns  (see,  for 
example,  Huston,  1990),  we  shall  use  the  term  configuration 
coordinates  in  addition  to  generalized  coordinates.  In  con¬ 
strained  multibody  systems  configuration  coordinates  refer  to 
the  variables  that  uniquely  describe  a  ^stem  s  configuration, 
but  may  be  dependent  upon  one  another.  Generalized  coor¬ 
dinates  represents  reduced,  of  independent  configuration 
coordinates. 

The  Influence  of  Generalized  Velocity  Component  Selec¬ 
tion.  Since  our  goal  in  this  paper  is  to  prescribe  a  choice  of 
generalized  velocity  components  that  would  yield  decoupled 
equations  of  motion,  it  is  first  necessary  to  reveal  the  influ¬ 
ence  of  such  a  selection  on  the  resulting  dynamical  equa¬ 
tions.  In  other  words,  we  would  like  to  expose  the  location  of 
the  matrix  T,  from  Eq.  (5),  within  Kane’s  equations.  The 
acceleration  of  the  mass  center  of  the  ith  body  can  be  written 
as 

a,.  =  y.ii  +  V;U.  (8) 

We  also  know  the  angular  acceleration  of  body  i  can  be 
expressed  as 

a>.  r^ii  +  Viu.  (9) 


S 


'Q 


O 
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r>Ar:r. 


From  this  we  can  write  the  derivative  of  angular  momentum  the  coefficient  matrix  for  the  first-order  generalized  speed 
as  vector  in  Eq*  (18^)  is  diagonal. 


Hi  -  In«  +  I|fiU  +  (10) 

where  li  is  the  central  inertia  matrix  and  is  the  angular 
velocity  matrix  associated  with  the  fth  body,  and  written  in 
terms  of  body  /’s  natural  frame.  Thus,  the  generalized  inertia 
force  becomes 

F*  =  _  £  [m;V/y.u  + 
i-  1 

+  (ii) 

From  the  relation  depicted  in  Eq.  (5),  it  is  easily  shown  that 
the  partial  velocity  matrix  of  Eq.  (3)  can  be  expressed  as 

c?V:  <?r,-  ^  ^ 

—4  =  --77T  =  — tT  (12) 

TU^  dq^  dq^ 

where  is  r,  the  Euclidean  position  of  body  z’s  mass  center 

with  respect  to  the  inertial  reference  frame.  We  can  write 
this  as 

X  =  J,T  (13) 

where  is  the  partial  derivative  of  r,-  body’s  mass  center 
position  with  respect  to  the  vector  of  the  generalized  coordi¬ 
nates.  Similarly,  Eq.  (4)  is 

d(0:  do):  ^  ^ 

— T  =  ~t?T  (14) 

or 

r,  -  n,T  (15) 

where  is  the  partial  derivative  of  body  f ’s  angular  velocity 
with  respect  to  the  time  derivative  of  the  generalized  coordi¬ 
nate  vector.  Substitution  of  Eqs.  (13)  and  (15)  into  Eq.  (6) 
gives  the  following  expression  for  the  equations  of  motion: 

-  E  +  m,T^jr4(J,T)u 


+T^firi,fl,Tu  +  (a,.T)u 

=0.  (16) 

By  letting 

A= -i:[m,j7-j.  +  nyi,n,.],  (i?) 

1-1 

a  complete  set  of  equations  of  motion  can  be  expressed  as 
T’ATu  -  E  ^  (J,-T)u 


+T^flfl,-(n,T)u  + 


h  -0  (18a) 


q  =  Tu. 


Selecting  T  for  Decoupling.  Clearly,  from  the  above  re¬ 
sults,  first-order  decoupled  equations  of  motion  will  be  gen¬ 
erated  if  T^AT  is  a  diagonal  matrix.  Therefore,  we  first 
consider  how  this  transformation  can  be  influenced  by  T  to 
satisfy  this  condition.  Notice  that  if  the  eipnvectors  of  A  are 
used  as  the  columns  of  T,  a  diagonal  matrix  results  under  the 
similarity  transformation  T”^AT.  Moreover,  A  is  symmetric, 
hence,  its  eigenvectors  can  be  chosen  to  be  orthownal.  With 
orthogonal  eigenvectors  we  know  that  =  T  ,  and  the 
similarity  transformation  effectively  appears  as  the  leading 
matrix  for  the  first-order  terms  of  Eq.  (18a).  Thus,  a  suffi¬ 
cient  condition  for  obtaining  first-order  decoupled  equations 
of  motion  is  that  the  rate  transformation  matrix  T  be  com¬ 
prised  of  the  eigenvectors  of  A.  However,  it  should  be  em¬ 
phasized  that  it  is  not  a  necessary  condition  for  decoupling. 
In  fact  it  can  be  much  less  laborious  to  choose  the  elements 
of  T  to  satisfy  a  congruency  transformation  than  to  symboli¬ 
cally  determine  the  eigenvectors  of  A.  We  will  now  show  that 
decoupling  can  be  achieved  by  satisfying  an  alternative  suffi¬ 
ciency  condition.  That  is,  if  T  is  chosen  to  fulfill  a  specific 
congruency  transformation,  Eq.  (18a)  will  be  decoupled  in 
first-order  terms.  Consider  the  following  definition  (see,  for 
example,  Wade,  1951).  If  for  two  given  matrices  A  and  N 
there  exists  a  nonsingular  matrix  T  that  satisfies  the  relation 

T^AT  =  N  (19) 

then  A  and  N  are  said  to  be  congruent.  Furthermore,  we 
shall  utilize  a  theorem  (see  Wade,  1951)  stating  that  a  sym¬ 
metric  matrix  can  be  reduced  by  a  congruency  transformation 
to  a  diagonal  matrix  of  the  same  rank.  The  principal  motiva¬ 
tion  behind  employing  congruency  transformations  to  achieve 
decoupling  is  that  the  rate  transformation  matrix  can  be 
formed  directly  with  various  combinations  of  the  elements  of 
the  A  matrix.  At  this  point  we  propose  an  algorithm  for 
obtaining  the  matrix  T  that  yields  a  diagonal  matrix  under 
the  congruency  transformation  of  Eq.  (19).  This,  in  turn, 
would  be  used  in  Eq.  (186)  as  a  rate  transformation  matrix. 
Consider  the  symmetric  A  matrbc  written  as 


The  transformation  matrix  T  will  be  composed  of  m  factors, 
such  that  it  can  be  written 

T=TiT2T3...T„  (21) 

where  m  is  the  number  of  degrees-of-freedom  less  one 
(m  ^  n  -  IX  Ti  would  be  constructed  as  follows: 


1  “^12/^11 

0  1 


0 


It  is  now  clear  that  T  manifests  itself  in  the  transformation 
T^AT  in  the  first  term  of  the  left-hand  side  of  Eq.  (18a),  as 
far  as  first-order  generalized  velocity  components  are  con¬ 
cerned.  We  now  wish  to  explain  how  the  analyst  can  choose 
the  rate  transformation  matrix  T  in  Eq.  (186)  to  assure  that 


When  the  congruency  transformation 

TfATi  -  A  (23) 

is  performed,  the  resulting  A'  is  a  matrix  with  zero  elements 
in  row  one  and  column  one,  except  for  the  element  (see 
Eq.  (24)).  In  other  words,  it  will  be  block  diagonal  with  the 
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lower  right  mhy  m  submatrix  remaining  undiagonalized,  and 
is  given  here  as 


A'  =  TjATi  = 


“11 

0 

0 


0  0 
^11  ^12 
^21  ^22 

^ml  ^m2 


0 

bxm 

blm 


.  (24) 


The  same  procedure  is  performed  on  this  result,  except  T2  is 
constructed  as 


T2  = 


1  0  ..  ..  G 

0  1  “^12/^11  •  *  •  "“^Im/^ll 

0  0  1  ...  0 


0  0  ...  0  1 
The  congruency  transformation  is  performed  again: 


(25) 


(26) 


This  is  repeated  until  a  total  of  m  transformations  have  been 
completed,  and  the  A  matrix  is  converted  to  diagonal  form. 
One  additional  requirement  is  that  T  be  nonsingular.  How¬ 
ever,  this  is  assumed  since  the  final  diagonal  matrix  is  of  the 
same  rank  as  the  original  A  matrix.  Therefore,  T  must  be  of 
full  rank.  Before  discussing  the  use  of  the  orthogonal  com¬ 
plement,  an  example  is  presented  to  illustrate  decoupling 
with  the  congruency  transformation. 


Example  of  Decoupling  With  the  Congruency  Transforma¬ 
tion.  Consider  the  following  system  of  two  particles  con¬ 
nected  by  a  rigid,  massless  rod.  Sliding  mass  mj  is  con¬ 
strained  to  move  along  the  horizontal  axis  n2,  and  the  mass 
m2  must  stay  on  the  constant  radius  arc  with  respect  to  in 
the  Uj  —  02  plane,  as  shown  in  Fig.  1. 

Using  the  generalized  coordinates  and  ^2>  matrix  A 
is  assembled  using  Jj  and  J2  for  each  of  the  masses.  This 
gives 

2]  p’) 

and 


(28) 


which  can  be  combined  according  to  Eq.  (17)  to  obtain 


—mi  —  m2  —m2rcq2 
—m2rcq2  —m2r^ 


(29) 


Equation  (19)  is  used  to  form  the  congruency  transformation 
for  A  as 

T  =  [  ^  -«2rc?2/("*i  +  '"z) 

[0  1 


For  this  system  the  resultant  applied  force  vectors  for  each 
mass  are 


and 


fi  = 


0 


(31) 

(32) 


The  above  matrices  can  then  be  assembled  using  Eq.  (18a)  to 
yield  the  following  dynamical  equations: 

-m,  —  OT,  0 

u 

0  m2r^  -  mlrV92/(m,  -  mj) 


0 

-m2r^cq2^92“2/('”i  "*2) 


0 

-m2grsq2 


■  (33) 


The  complete  set  of  equations  of  motion  are  Eq.  (33)  arid  the 
following  equation  using  the  rate  transformation  matrix  given 
in  Eq.  (30): 


1  -m2rcq2/(mf+ nt2) 

.0  1 


(34) 


Equation  (34)  .  shows  that  the  generalized  velocity  compo¬ 
nents  obtained  using  the  congruency  transformation  include 
mass  terms.  This  is  slightly  different  from  the  definition  of 
the  relationship  between  u  and  q,  as  defined  by  Kane  (1985) 
to  be  a  function  of  q  and  time,  and  not  of  the  mass  properties 
of  the  system.  It  also  indicates  another  problem  with  the 
term  “generalized  speeds,”  or  even  “generalized  velocity 
components,”  for  the  elements  of  the  vector  u,  since  they 
depend  on  mass  properties  in  the  more  general  context  used 
in  this  paper.  We  shall,  however,  continue  using  “generalized 
velocity  components”  since  the  term  “generalized”  may  be 
interpreted  that  u  is  not  just  like  a  common  velocity  vector. 


Dynamics  of  Nonholonomic  Multibody  Systems 

The  equations  of  motion  given  in  Eqs.  (18fl)  and  (186) 
become  invalid  if  the  multibody  system  includes  constraints 
which  are  nonholonomic.  If  this  is  the  case,  the  constraints 
can  be  applied  to  the  system  with  an  orthogonal  complement, 
and  the  transformation  between  generalized  coordinate 
derivatives  and  generalized  velocity  components  is  no  longer 
one-to-one.  Additionally,  situations  might  arise  where  a 
mode!  is  given  with  unreduced  configuration  coordinates 
along  with  holonomic  constraint  equations  that  are  yet  to  be 
applied.  Again,  the  orthogonal  complement  is  used  to  impose 
such  constraints.  These  ideas  are  discussed  next,  and  illus¬ 
trated  with  an  example. 

The  Orthogonal  Complement.  Huston  (1990)  shows  that 
if  the  motion  constraint  equations,  either  holonomic  or  non- 
holonomic,  are  written  as 

Bq  =  0  (35) 

and  the  unconstrained  equations  of  motion  are  as  depicted  in 
Eq.  (1),  a  valid  set  of  reduced  equations  describing  the 
dynamics  of  the  constrained  system  is 

C^Mq  =  C^g(q,  q)  (36) 

where  C  is  the  orthogonal  complement  of  B.  Recall  C  is  the 
orthogonal  complement  of  B  if  BC  =  0  is  satisfied.  C  can  be 
thought  of  as  a  non-square  transformation  between  the  unre¬ 
duced  generalized  coordinate  derivative  vector  q,  and  a  re¬ 
duced  generalized  coordinate  derivative  vector,  say  z: 

q  =  Cz.  (37) 
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Let  us  rewrite  Eq.  (36)  in  similar  form  to  Eq.  (16)  by  first 
writing 

-  E  (J,)q 

+c’'a[i,n,.q  +  c^aji,y(a,)q 


+  C’'firw^.I,a,,.  -  C^jTf.  -  -  0.  (38) 

We  can  substitute  the  expression  for  q  given  in  Eq.  (37)  into 
Eq.  (38)  to  establish  equations  of  motion  in  terms  of  the  new 
reduced  generalized  coordinate  vector  z  and  its  derivatives: 

-  E  [m,C^j/-J,Cz  +  m,C^3Tj(JiC)z  +  C^fifl.n,Cz 

i- 1  L  ^ 


+C^ftJ’l,.-(n.C)z  +  C’’nfWiI,.a., 
at 


=  0.  (39) 

One  can  see  that  the  role  of  the  matrix  C  in  Eq.  (39)  is 
equivalent  to  that  of  T  in  Eq.  (16).  Hence,  we  can  view 


Fig.  2  A  sliding  pendulum  with  unreduced  configuration  coordi¬ 
nates 


in  -  m)  independent  eigenvectors  associated  with  the  zero 
eigenvalues.  If  the  columns  of  C  are  comprised  of  these 
eigenvectors  we  have 

B^BC  -  0.  (45) 


Premultiplying  by  gives 


C^B^BC  =  0 


BC-0. 


as  special  partial  velocity  matrices.  Generalized  velocity  com¬ 
ponents  can  be  introduced  easily  using  the  transformation 

2  =  Tu.  (42) 

With  this,  the  transformation  between  unreduced  general¬ 
ized  coordinate  derivatives  and  reduced  generalized  velocity 
components  is  written  as 

q-CTu.  (43) 

A  new  set  of  equations  of  motion  in  terms  of  reduced 
generalized  velocity  components  can  now  be  given  as 

-  £  m,.T^C^jfj,Cru  +  m,.T^C^j/'^(J,CT)u 
i-lL 

+T^C’"nfl,-fl,.CTu  +  T^C’"nfl,^(fi,.CT)u 

=  0  (44) 

which  are  accurate  in  describing  the  behavior  of  the  con¬ 
strained  system.  Note  that  the  matrix  product  CT  shown  in 
Eq.  (43)  represents  a  modified,  nonsquare,  rate  transforma¬ 
tion  matrix. 

There  are  several  procedures  for  determining  the  orthogo¬ 
nal  complement  matrix  as  demonstrated  by  Huston  (1990) 
and  Hemami  and  Weimer  (1981).  Here,  we  will  consider  the 
zero-eigenvalue  approach  discussed  by  Huston.  If  there  are 
m  motion  constraints  imposed  on  a  system  that  has  n  de- 
grees-of-freedom,  the  mby  n  matrix  B  from  Eq.  (35)  will  be 
of  rank  m.  B  premultiplied  by  its  transpose  will  be  a  symmet¬ 
ric  n  by  n  matrix  also  of  rank  m.  Hence,  B^B  will  have 


Therefore,  C  is  the  orthogonal  complement  of  B.  With  C 
specified,  we  can  now  focus  on  the  coefficient  matrix  for 
generalized  velocity  component  derivatives  from  Eq.  (44)  by 
grouping  all  other  terms  into  the  function  h.  This  gives 

T^C^ACTii  =  h(q,u).  (48) 

The  matrbc  A  is  the  same  as  defined  in  Eq.  (17).  To  obtain 
first-order  decoupled  equations  for  the  constrained  system 
we  simply  choose  T  to  be  the  proper  congruency  transforma¬ 
tion,  this  time  for  C^AC,  using  the  procedure  outlined  ear¬ 
lier. 

Example:  Unreduced  Configuration  Coordinates.  We 
will  now  repeat  the  example  carried  out  above,  but  now  using 
the  dependent  configuration  coordinates  shown  in  Fig.  2. 

The  constraint  imposed  by  the  rigid  rod  is  temporarily 
removed.  The  position  Jacobians  for  the  two  masses  now 
appear  as 


j2-[5 


The  resulting  A  matrix  is 


-m,  — 


Now  we  must  find  the  orthogonal  complement  to  reimpose 
the  constraint.  Therefore,  the  constraint  representation 
should  be  put  in  the  form  of  Eq.  (35).  For  the  simple 
pendulum  this  constraint  is  written  as 

Bq  ==  [0  q2  qalq  =  0- 

To  find  an  orthogonal  complement  the  zero  eigenvectors  of 
B^B  must  be  found.  However,  for  this  problem  the  vectors 
comprising  C  can  be  obtained  even  more  simply,  using  the 
Gram-Schmidt  process,  for  example.  The  orthogonal  comple¬ 
ment  is  found  to  be 
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We  can  form  C^AC  to  allow  determination  of  T  as 


1  +  "*2) 

0  1 


(54) 


The  necessary  components  are  now  available  to  produce  the 
constrained  dynamical  equations  of  motion  using  Eq.  (44). 
The  applied  force  vectors  remain  the  same  as  in  the  previous 
example.  The  dynamical  equations  in  terms  of  the  reduced 
generalized  velocity  component  vector  are 

—nil  —  m2  0 

0  ^2) 


0 

-"»2'?392“i/('”l  +  "*2) 


0 

^28^3 


(55) 


The  additional  equations  of  motion  stemming  from  the  trans¬ 
formation  between  configuration  coordinates  and  reduced 
generalized  velocity  components  are 


q  = 


1 

0 


-"»292/("»1  +  "*2) 
1 


(56) 


Numerical  Implementation  of  Congruency  Transfor- 
mations 

Very  often  one  must  analyze  large-scale  systems  where  it 
would  be  very  tedious  to  symbolically  obtain  the  proper 
congruency  transformation.  For  this  reason,  we  now  discuss 
numerical  implementation  of  the  congruency  transformation 
so  that  it  may  be  used  on  more  complicated  systems.  Recall 
the  expression  for  the  equations  of  motion  shown  in  Eq.  (16), 
except  here  we  do  not  require  the  matrix  T  to  be  a  diagonal¬ 
izing  congruency  transformation.  In  this  case  T  may  be  any 
matrbc  that  yields  a  valid  vector  of  generalized  velocity  com¬ 
ponents.  The  equations  of  motion  for  a  holonomic  system  are 

p 

-E 

/-I 

d 

-j-T^nfL-n.-Tu  + 

dt 


m,.T’'jrj,Tu  +  (J,T)u 

dt 


=  0.  (57) 


Notice  that  the  derivative  of  the  rate  transformation  matrix  T 
appears  in  two  of  the  terms  in  Eq.  (57).  Even  though  it  would 
be  possible  to  select  T  to  be  a  diagonalizing  congruency 
transformation  at  each  iteration  of  the  numerical  integration, 
we  would  be  left  with  the  cumbersome  task  of  specifying  its 
time  derivative  as  well.  Hence,  the  following  development 
allows  selection  of  a  matrix  which  is  independent  of  the 
original  dynamical  equations  (Eq,  (57)),  and  does  not  appear 
in  derivative  form.  However,  one  must  still  choose  a  valid 
linear  combination  of  generalized  coordinate  derivatives 
(prescribed  by  T)  to  define  generalized  velocity  components, 
as  usually  done  with  conventional  application  of  Kane's  equa¬ 
tions.  For  example,  it  is  common  to  choose  a  trivial  set  of 
generalized  velocity  components  by  selecting  T  as  the  identity 
•matrix.  Denote  the  matrix  products  premultiplying  u  from 
Eq.  (57)  as 

A  =  T^l  -  E  +  nri,fi.]  J  (58) 


and  the  remaining  vector  sum  as  h(q,  u)  so  the  equations  of 
motion  can  be  written  as 

Au  =  h(q,  u).  (59) 

Let  us  introduce  the  vector  x  in  the  following  expression; 

u  =  q^x.  (60) 

Substituting  this  into  Eq.  (59)  yields 

A^x  =  h(q,u).  (61) 

We  can  now  premultiply  both  sides  of  Eq.  (61)  by  giving 
us 

q^^Aq^x  =  ‘Sk^h(q,  u).  (62) 

However,  notice  the  left-hand  side  of  Eq.  (62)  is  a  congru¬ 
ency  transformation  on  the  symmetric  matrix  A.  If  is 

chosen  as  prescribed  earlier,  will  be  diagonal.  Let 

L  =  q^^Aq^  so  Eq.  (62)  becomes 

Lx  =  q^^h(q,  u).  (63) 

Vector  X  can  be  written  as 

X  =  L"^^^h(q,  u)  (64) 

when  the  inverse  of  L  is  simply  comprised  of  the  reciprocals 
of  its  diagonal  elements.  Finally,  the  resulting  first-order 
decoupled  form  in  terms  of  u  is 

u  =  q^L-^q^^h(q,u).  (65) 

We  see  that  Eq.  (65)  is  well  structured  for  direct  numerical 
integration  without  inversion  of  the  A  matrbc. 

Conclusion 

We  have  proposed  and  demonstrated  a  method  for  creat¬ 
ing  rigid-body  equations  of  motion  that  are  decoupled  in 
first-order  terms.  This  is  achieved  by  properly  choosing  a 
congruency  transformation  that  specifies  generalized  velocity 
components.  For  nonholonomic  systems,  or  holonomic  sys¬ 
tems  with  unreduced  configuration  coordinates,  the  congru¬ 
ency  transformation  is  used  in  conjunction  with  an  orthogo¬ 
nal  complement  to  the  constraint  array.  In  both  cases,  the 
resulting  equations  are  in  a  form  that  make  it  convenient  for 
nonlinear  behavior  analyses.  Moreover,  it  becomes  an  easy 
matter  to  implement  general  integration  routines  for  first- 
order  differential  equations  to  obtain  generalized’ coordinate 
trajectories  for  simulation  purposes. 
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ABSTRACT 

In  this  paper  we  present  a  method  for  dynamically  interpo¬ 
lating  the  motion  of  a  rigid  body  in  space  between  known 
end-positions.  We  begin  by  creating  an  image  space  repre¬ 
sentation  of  the  equations  of  motion  for  a  rigid  body.  These 
equations  are  written  as  a  system  of  first-order  state  equa¬ 
tions  whose  trajectory  is  optimized  based  on  the  minimiza¬ 
tion  of  a  certaiin  performance  index.  This  leads  to  a  set  of 
boundary-value  equations  between  the  end  positions,  which 
when  solved  give  the  interpolated  motion.  Finally,  a  pro¬ 
gram  developed  to  perform  such  interpolations  numerically 
is  utilized  by  generating  a  trajectory  between  given  terminal 
states. 

INTRODUCTION 

Within  the  field  of  computer  graphics,  especially  as  it  ap¬ 
plies  to  mechanical  system  simulation  and  computer  anima¬ 
tion  the  problem  of  finding  a  mathematical  representation 
of  a  rigid  body  motion  that  approximates  a  sequence  of  ar¬ 
bitrary  displacements  is  commonly  encountered.  In  other 
words  the  end-positions  of  a  rigid  body  spatial  trajectory 
are  known,  but  the  trajectory  itself  is  unknown.  Through¬ 
out  this  paper,  the  term  ‘position’  refers  to  the  position  of  a 
point  on  a  rigid  body,  as  well  as  the  orientation  of  the  body. 
The  problem  of  determining  the  intermediate  positions  has 
been  referred  to  as  key  framing,  or  geometric  design  of  mo¬ 
tion  interpolants  (see  Ravani  and  Ge  (1994)).  It  has  ap¬ 
plications  in  computer  graphics,  computer  vision,  robotics, 
bio-mechanics,  other  fields.  Figure  1  illustrates  the  inter¬ 
polated  trajectory  of  a  rigid  body.  Many  researchers  have 
explored  methods  to  interpolate  the  unknown  motion  of  the 
body.  Traditionally,  the  problem  has  been  separated  into  in¬ 
terpolations  of  translations  and  rotations.  Dealing  with  the 
translation  is  straight  forward,  however  the  rotation  inter¬ 
polation  poses  considerable  difficulties.  Initially,  attempts 
were  made  at  individual  interpolation  of  the  Euler  angles, 


Figure  1:  Dynamically  Interpolated  Motion 

but  the  resulting  motions  seemed  to  speed  up  or  slow  down. 
More  recently,  Shoemake  (1985)  and  Pletickx  (1989)  investi¬ 
gated  subdivision  methods  using  quaternions  to  interpolate 
rotations.  Ravani  and  Ge  (1994)  transform  the  motion  in¬ 
terpolation  problem  into  a  point  interpolation  in  the  image 
space,  and  extend  the  results  of  Shoemake  by  properly  con¬ 
sidering  the  geometry  of  the  underlying  space  of  three  di¬ 
mensional  displacements.  Needless  to  say,  the  common  trait 
of  these  developments  is  their  kinematic  foundation.  That 
is,  these  methods  have  provided  ways  to  create  the  unknown 
trajectory  without  regard  to  the  dynamics  of  the  motion  of 
the  rigid  body  being  animated.  The  motivation  behind  the 
developments  in  this  paper  is  to  dynamica/Zyinterpolate  the 
motion  of  the  body  between  the  given  end-positions  for  ani¬ 
mation  and  other  purposes.  Thus  the  result  is  a  dynamically 
consistent  specification  of  the  pathway  followed  by  the  body, 
as  well  as  its  velocity  distribution. 

To  solve  the  problem  of  dynamically- based  interpolation, 
an  image  space  representation  of  the  equations  of  motion  for 
a  single  rigid  body  is  first  created.  These  equations  become 
the  system  equations  for  a  trajectory  optimization  based  on 
the  minimization  of  a  certain  performance  index.  The  solu¬ 
tion  of  the  optimization  problem  describes  the  pathway  of  an 
image  point,  and  the  velocity  of  that  point  along  the  path. 
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This  can  then  be  resolved  into  finite  and  infinitesimal  rigid 
body  displacement  information  specifying  the  reconstructed 
motion.  and 


(1) 


MOTION  REPRESENTATION 

In  this  section  we  convert  all  kinematical  entities  such  as 
Euclidean  point  position  and  velocity,  rigid  body  orienta¬ 
tion  parameters,  and  angular  velocity  of  the  body  to  an 
image  space  representation.  The  new  kinematiccJ  descrip¬ 
tion  is  used  to  generate  a  unique  set  of  equations  of  motion 
in  terms  of  an  image  space  state  vector.  We  see  that  the 
image  space  allows  us  to  treat  the  position  and  orientation 
of  a  rigid  body  in  space  as  a  single  point  in  a  dual  four- 
dimensional  space.  More  importantly,  the  time  derivative 
of  this  image  space  point  position  gives  us  an  unambiguous 
representation  of  the  rate  of  change  in  orientation  of  the 
body.  Hence,  we  can  introduce  a  sixteenth  order  state  vec¬ 
tor  that  contains  the  orientation  and  time  rate  of  change 
of  orientation  data.  In  an  essence,  the  image  space  helps 
us  circumvent  the  incomplete  analogy  between  velocity  and 
angular  velocity.  By  ‘‘incomplete  analogy”  we  mean  that 
in  the  Euclidean  space  of  three  dimensions  velocity  can  be 
found  from  the  derivative  of  a  position  vector,  but  angular 
velocity,  in  general,  can  not  be  determined  from  the  deriva¬ 
tive  of  an  orientation  vector.  With  the  mapping  to  the  image 
space,  position  and  orientation  are  captured  as  one  entity  - 
the  image  space  position.  The  derivative  of  this  point  gives 
us  the  velocity  and  angular  velocity  of  the  body.  The  image 
space  equations  of  motion  then  become  the  first  order  state 
space  system  whose  trajectory  is  optimized  based  on  effort 
minimization. 

Newton— Euler  Equations  in  the  Image  Space 

Our  goal  in  this  section  is  to  map  the  equations  describing 
the  dynamical  motion  of  a  rigid  body  to  the  image  space. 
The  kinematic  advantages  of  the  image  space  axe  illustrated 
by  Ravani  and  Ge  (1994),  and  here  we  will  demonstrate  vari¬ 
ous  dynamic  benefits  of  this  representation.  Using  the  image 
space  representation,  the  components  of  a  point  position  in 
the  image  space  become  the  configuration  coordinates,  as 
opposed  to  the  Euclidean  position  and  angular  orientation 
coordinates  ordinarily  used.  The  final  results  of  this  section 
are  the  equations  of  motion  for  a  single  rigid  body  in  state 
space  form.  They  will  be  first-order  nonlinear  equations  in 
terras  of  a  state  vector  z  (to  be  defined  later).  The  com¬ 
ponents  of  this  vector  completely  encompass  the  rigid  body 
displacement  and  time  rate  of  change  of  displacement  infor¬ 
mation.  Here,  we  begin  with  the  well-known  Newton- Euler 
equations  of  motion  for  a  rigid  body,  and  transform  them 
to  a  system  of  sixteen  first  order  equations. 

The  Newton-Euler  equations  for  a  rigid  body  with  a  body 
fixed  coordinate  system  aligned  in  the  principal  directions 
and  located  at  the  center  of  mass  are  written  as 

fi  =  mdi 

/2  =  md2 


Tl  =  IiWi  —  {I2  — 

T2  =  l2(^2  —  {h  —  /i)a;3a;i 

n  =  —  {h  —  l2)u^iu?2  (2) 

where  fi  and  t,*  (t  =  1,2,3)  are  the  applied  moments  and 
forces  in  the  ei,  62  and  63  directions  of  a  body-fixed  co¬ 
ordinate  frame.  Ii  ,  I2  and  J3  are  the  principal  centroidal 
moments  of  inertia  and  a;,  (i  =  1, 2, 3)  are  the  angular  veloc¬ 
ity  components  expressed  in  terms  of  the  body-fixed  frame. 
We  can  now  convert  all  kinematical  entities  to  the  image 
space.  If  the  image  space  point  x  is  written  in  quaternion 
form  as 

x  =  xii^X2}  +  xsk  -f  X4  (3) 

the  dual  velocity  distribution  of  the  body  can  be  expressed 
as  the  following  quaternion  product: 

V  ==  2:kx*  (4) 

where  x*  is  the  conjugate  of  the  dual  quaternion.  This  dual 
velocity  entity  can  be  thought  of  as  a  dual  number  combi¬ 
nation  of  angular  and  linear  velocity  and  written  as 


V  = 


(5) 


where  a;  is  the  angular  velocity  of  the  body  and  d  is  the 
velocity  vector  of  the  origin  of  the  body-fixed  frame.  The 
dual  quaternion  product  can  be  expanded  in  terms  of  the 
dual  Euler  parameters  and  their  derivatives,  and  written  in 
matrix  form  as 

V  =  2Dx  (6) 

where  the  dual  matrix  D  is  written  as 


D  =  D+€D°.  (7) 

The  real  part  of  D  is 
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and  the  dual  part  is 
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The  real  and  dual  parts  of  the  first  three  components  of  V 
comprise,  respectively,  the  angular  velocity  and  the  velocity 
of  the  origin  of  the  body’s  fixed  frame,  as  stated  in  Equation 
5.  The  matrix  product  of  Equation  6  can  be  separated  into 
real  and  dual  parts  as 


V  =  2Dx  +  2e(Dx° -hD^x).  (10) 
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Therefore,  using  this  relation  and  Equation  5,  the  angular 
and  linear  velocity  terms  become 

[“1=2D.  (11) 


=  2(r)x°  +  D°x).  (12) 

From  these  we  can  obtain  the  necessary  derivatives  and 
products  a?i,  0^2  and  d;3,  di,  ^2  and  rfa,  and  ^20^3, 
and  wia;2  in  terms  of  image  space  components  to  be  substi¬ 
tuted  into  Equations  1  and  2.  However,  before  doing  this, 
it  is  helpful  to  define  a  sixteen  element  state  vector  z  as 

=  26{Dx°+D®x).  (13) 


We  also  define 

Za  =  [  22  .  Zg  (14) 

and 

Z6  =  [  29  zio  .  zie  j  .  (15) 

Z6  can  be  thought  of  as  the  position  of  a  point  in  an  eight¬ 
dimensional  vector  space.  This  position  corresponds  to  a 
homogeneous  representation  of  the  position  and  orientation 
of  the  rigid  body  in  three  dimensions.  Analogously,  Za  is 
the  velocity  of  that  point  (note  Za  =  Zb),  and  defines  the 
time  rate  of  change  of  position  and  orientation  of  the  body. 

Equation  11  is  expanded  and  elements  of  state  vector  z 
are  substituted  to  get  the  components  of  the  angular  velocity 
vector  as 


d 

0 


ujx  =  2{—z\z^  -h  232^10  —  22^^11  +  ZXZ12) 

W2  =  2(— Z3^9  “  24-2^10  +  *2^1  <511  +  ^25^12) 
iJZ  =  2(z2Z^  —  ZlZlO  —  2^42^11  +  ZzZi2)-  (16) 

These  expressions  are  time  differentiated  to  yield,  after  some 
simplification, 

LJi  =  2(“-Z429  4*  is^lO  —  22^11  +  212^12) 

Cj2  =  2{—zzZq  —  z^zio  +  iizii  4-  ^22^12) 
cha  =  2(i2Z9  —  zizio  —  24^^11  +  23212)-  (17) 

In  a  similar  manner  we  obtain  the  velocity  and  acceleration 
components  of  the  origin  of  the  body-fixed  frame: 

di  =  2(  — Z4213  4“  23214  —  22215  4*  2i2i6  ”” 

28  29  4-  27210  “  26  211  -H  25212) 
d2  =  2(~232i3  —  24214  +  2i2i5  +  Z2Zi$  — 

2729  —  28210  +  25211  +  25212) 
dz  =  2(z22i3  —  2i2i4  —  24215  -|-  ZgZie  + 

2629  -  25210  -  28211  -j-  27212)  (18) 


and,  upon  time  differentiation, 

il  =  2(~24  2i3  4“  23214  “  22  215  -j-  2i2i6  ~ 

2829  4-  27210  —  26211  +  25212) 

d2  =  2(-232i3  -  24214  4“  21215  +  Z22i6  - 
2729  —  28210  4  25211  4-  25212) 
dg  =  2(22213  -  21214  -  24215  +  23  216  4- 

2529  —  25210  —  28211.+  27212).  (19) 

Finally,  we  denote  the  angular  velocity  product  quantities 
as: 


hi2  —  c<;ia;2 

^23  = 

hiz  “  wiu;3  (^9) 

The  above  quantities  are  substituted  into  Equations  1  and 
2,  and  the  six  Newton-Euler  equations  become 

til  =  2m(— 24213  +  23214  —  22215  +  2i2i6  “ 

2829  +  27210  —  25211  +  25212) 

U2  =  2m(  — 23213  —  24214  +  21215  +  22216 
2729  —  28210  +  25211  +  25212) 

113  =  2m(222i3  —  2i2i4  —  24  215  +  23216  4 

2629  -  25210  -  28211  +  27212)  (21) 

and 

ii4  =  2ii(— 2429  +  23210  —  22211  4  21212)  “ 

(J2  —  h)h23 

Us  =  2l2{—ZzZ9  —  24Z10  4  2i2ii  4  22212)  ““ 

(h-Ii)hzi 

ue  =  2/3(2229  —  21Z10  —  24211  4  23212)  — 

(Ji  -  I2)hi2  (22) 

where  u  is  simply  a  vector  of  applied  forces  and  torques, 
and  is  written  here  as 

u  =  [  /i  /2  h  n  T2  rs  ]  ^ .  (23) 

Since  the  equations  of  motion  are  now  written  in  terms  of 
eight  configuration  coordinates,  we  need  an  additional  two 
equations  to  have  a  complete  set.  These  can  be  generated 
from  the  image  space  constraints  given  in  (Ravani  and  Ge, 
1994).  We  rewrite  them  here  in  terms  of  the  state  vector  z 
as 

29  +  ^10  ^11  “^12  =  i  (2^) 

and 

29213  4  2io214  4  2ii2i5  4  2i2  2i6  =  0.  (25) 

If  we  time  differentiate  each  of  these  equations  twice  we 
obtain  the  two  additional  state  equations  as 

2i2i3  4  2529  4  22  214  4  ZgZio  4 
23215  4  27211  4  24216  4  28212  4 
2(2125  4  2226  4  2327  4  2423)  =  0  (26) 
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21^13  +  2539  +  32214  +  36310  +  Zl  +  zj  +  Z3  +  =  0.  (27) 

Equations  21,  22,  26  and  27  can  be  ordered,  and  put  into 
matrix  format,  yielding 

Hza  =  g  (28) 


Uif2li  -f  (h  —  iz)h2zl2Ii 
165/2/2  +  (/a  —  /i)/i3i/2/2 

U6/2/3  +  (/i  —  h)hi2l2h 

_  5.2  _  ^2  _  2 
ff  =  ^1  2:2—23—2:4 

^  wi/2m 

U2l2m 
uzf2m 

_  — 2(zi  Zs  H-  Z22:6  +  Z32^7  “f  ^4  2^$) 

The  coefficient  matrix  H  is  paxtitioned  as 


C  0 
A  B 


Zl5 

-.^14 

213 

—  •215 

-2^16 

213 

214 

-Zl3 

.—  216 

215 

2l3 

^14 

215 

216 

Zl2 

-Zll 

2l0 

-Zg 

2^11 

2^12 

-zg 

—  2i0 

—  •2^10 

^9 

212 

-211 

Z9 

2:10 

211 

2i2 

C  =  B,  (33) 

The  advantage  of  this  particular  ordering  becomes  clear 
when  we  consider  one  of  the  Schur  formulas  for  the  inverse 
of  a  partitioned  matrix  (see,  for  example,  Brewer  (Brewer, 
1990)).  This  formula  states  that  the  matrix  H  partitioned 
as  shown  in  Equation  30  has  the  inverse 


-B-^AC-^ 


Symbolic  inversion  of  H  becomes  even  easier  when  we  utilize 
the  fact  that  B  is  an  orthogonal  matrix.  It  is  a  simple  matter 
to  verify  the  orthogonality  of  B  by  realizing  it  satisfies  the 
following  two  conditions:  the  inner  products  between  each  of 
the  column  vectors,  or  between  each  row  vectors  comprising 
B  are  zero,  and  each  row  or  column  vector  is  of  unit  length. 
The  first  condition  can  be  visually  verified  and  the  second 
follows  from  Equation  24.  Therefore  B~^  =  B^,  and 
can  be  written  as 


B^ 

-B^AB^ 


Hence,  we  can  rewrite  Equation  28  as 


keeping  in  mind  that  numerical  inversion  of  H  is  not  neces¬ 
sary.  The  complete  equations  of  motion  are 


Notice  these  equations  are  first-order  decoupled  -  a  charac¬ 
teristic  that  is  necessary  to  initiate  the  development  of  the 
optimal  control  problem  in  the  next  section. 

OPTIMAL  TRAJECTORY  GENERATION 

If  the  image  space  rigid  body  equations  of  motion  are  viewed 
as  a  nonlinear  state  space  system  we  can  cast  the  dynamic 
interpolation  problem  into  the  determination  of  an  optimal 
trajectory  based  on  optimal  control  theory.  In  this  paper 
the  optimal  rigid  body  trajectory  is  the  one  that  minimizes 
the  total  force  and  torque  applied  to  the  body,  as  well  as 
their  derivatives,  over  a  certain  time  period.  Other  perfor¬ 
mance  indices  could  be  chosen,  such  as  power  or  total  energy 
minimization.  However,  we  restrict  our  analysis  to  applied 
force  and  torque  minimization  in  the  following  development. 
The  equations  of  motion  depicted  in  Equations  1  and  2  can 
be  written  as 

z=:a(z,u)  (38) 

where,  according  to  Equation  37 


H-‘s 


The  performance  measure  we  wish  to  minimize  is 


(u)=  f  ?(u) 


Za  =  H-^g 


ff(u)  =  -u^u.  (41) 

It  is  convenient  to  use  the  function  referred  to  as  the 
Hamiltonian,  which  is  defined  as  (see,  for  example,  Bryson 
and  Ho  (1975)  and  Kirk  (1970)) 

H  =  ^(u)  -i-  A^a  (42) 

where  A  is  a  vector  of  Lagrange  multipliers.  From  this  we 
can  state  the  necessary  conditions  to  determine  an  optimal 
trajectory  as 

(which  is  equivalent  to  Equation  38) 

A  =  -^  (44) 

and 

Equations  43,  44  and  45  comprise  a  set  of  thirty  two  bound¬ 
ary  value  equations,  which  when  solved,  give  us  the  opti¬ 
mal  state  trajectories  of  the  body.  These  states  translate  to 
mass  center  position  and  velocity,  and  orientation  and  rate 


178 


of  change  of  orientation  trajectories  for  the  body  between 
the  given  endpoints.  Naturally,  these  trajectories  satisfy  the 
boundary  conditions  imposed  by  the  analyst.  However,  it 
must  be  emphasized  that  Equations  43,  44  and  45  do  not 
require  the  force  and  torque  components  to  satisfy  any  spe¬ 
cific  boundary  conditions-  Hence,  Equation  38  should  be 
modified  to  make  the  necessary  force  and  moment  accom¬ 
modations.  We  can  append  the  relation 

li  =  w  (46) 

to  Equation  38  to  obtain 

z  =  a(z,w)  (47) 


where  the  state  vector  z  is  now  a  22-element  vector  contain¬ 
ing  the  original  sixteen  states  along  with  the  components  of 
force  and  torque.  With  this  adjustment  the  forces  and  mo¬ 
ments  are  now  treated  cis  states,  and  w,  the  time  derivative 
of  the  applied  force  vector,  becomes  the  control  vector.  The 
vector  a  is  rewritten  as 


a(z,w) 


(48) 


We  also  alter  g  to  include  the  new  control  vector: 


^(u,  w)  =  2  +  w^w) 


(49) 


Equation  44  remains  the  same  as  given,  keeping  in  mind 
that  z  and  H  have  changed.  Equation  45  becomes 


0  = 


dU 

dv/' 


and 

Zc  =  vv.  (58) 

The  partial  derivative  given  in  Equation  43  can  now  be  sep¬ 
arated,  taking  advantage  of  the  differing  composition  of  the 
sub-components  of  z.  The  first  portion  is  comprised  of  the 
first  eight  elements  of  A,  and  is  written  as 


(59) 


or 


:  ^  \  T  .  T  ^26 


dz'S 


(60) 


Upon  substitution  of  expressions  from  Equations  56,  57  and 
58  the  last  equation  becomes 


rg(H-^g) 
^  “  dzl 


o_T  “r Ci_T)' 


dzl 


dzl 


(61) 


H  and  w  are  not  functions  of  the  first  eight  elements  of  z, 
which  comprise  Zo,  so  we  simplify  Equation  61  into 


Ar=-(ArH-'^+A?’). 


(62) 


The  second  partial  derivative  portion  is 


(63) 


Tdjn-^g)  ^  T  dZa  ,  ^  T  ^  W 


dzl 


Here,  we  can  use  some  matrix  manipulation  and  the  kro- 
necker  product  ‘‘0”  to  rewrite  Equation  64,  devoid  of  the 
partial  derivative  of  H'  ''\.as 


The  modified  set  of  boundary  value  equations  given  by 
Equations  47,  44  and  50  allow  the  additional  specification  of 
the  force  state  at  each  endpoint,  and  the  resulting  force  tra¬ 
jectory  must  satisfy  these  conditions.  The  modification  of 
g  to  include  the  time  derivative  of  the  u  will  help  minimize 
sharp  changes  in  the  applied  forces  to  the  body.  We  continue 
by  further  analyzing  the  form  of  the  co-state  equations. 

For  notational  ease  define  the  Lagrange  multiplier  vectors 


Aa  =  [  Ai  A2  . 

As  ]^ 

(51) 

Ab  =  [  A9  Alo  . 

Ais 

(52) 

and 

Ac  =  [  Ai7  Ai8  . 

A.2 

(53) 

thus, 

A  =  [  Aa  1  At  1 
Next,  we  partition  z  similarly  as 

Aa]". 

(54) 

Z  =  [  Za  1  26  1 

Za]" 

(55) 

thus, 

Za  =  H-‘g, 

(56) 

26  ”  Zci 

(57) 

A^  =  -A^(-H-'  ^(I»  ®  H-')(l8  ®  g)  + 
H-‘^). 


(65) 


Finally,  since  performance  measure  ^  is  a  function  of  u  we 
must  write 

.  T  _  dg  T 

dz? 


or 


(66) 


Ac  —  Q2.T  azT  ^ 

A? 


az 

xT  az 


_ ,  \Taw\ 

^  izf  azY 


(67) 


Recall  that  zf  is  actually  the  applied  force  vector  u  so  Equa¬ 
tion  67  is  written  as 


Ac  -  AcJn  2 


(68) 


which  becomes 

Ar  =  -ArH-‘^-u^.  (69) 

Equations  62,  65  and  69,  often  referred  to  as  the  co- state 
or  adfoini  equations  (1975;  1970),  complete  the  set  of  forty 
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and  at  the  final  state 


Figure  2:  Initial  and  Final  Positions 


four  boundary  value  equations.  However,  notice  that  the 
control  vector  w  appears  explicitly  in  vector  a  in  the  state 
equations.  Since  this  vector  is  unknown  prior  to  the  inte¬ 
gration  of  the  boundary  equations,  it  must  be  replaced  with 
components  of  z  and  A.  The  condition  stated  in  Equation  45 
provides  the  relationship  to  enable  this  substitution.  Equa¬ 
tion  45  can  be  expanded  to  give 


This  becomes 


or 


„  3ff(u,w)  ,  az 

(70) 

„  1  5w^w  ,  ,  T  9-w 

“  ~  2  aw^  '  ' aw^ 

(71) 

1 

II 

(72) 

Equation  72  is  the  explicit  expression  for  the  control  vector 
enabling  elimination  of  w  from  the  state  equations  and  co¬ 
state  equations.  We  illustrate  these  ideas  with  the  following 
example. 


EXAMPLE 


The  ideas  developed  in  this  paper  are  demonstrated  here 
through  numerical  integration  of  the  boundary  value  equa¬ 
tions  just  derived.  We  will  perform  dynamic  interpolation 
of  a  rigid  body  motion  between  an  initial  and  final  state. 
The  information  comprising  these  states  are  the  image  space 
velocity  Za,  the  image  space  position  Zb  a,nd  the  vector  of 
applied  forces  and  torques  u.  For  the  initial  state  we  have 


"  0.0  ‘ 

■  0.0  ■ 

0.0 

0.0 

'  0.0  ■ 

0.0 

0.0 

0.0 

0.0 

1.0 

0.0 

0.0 

Zb  = 

0.25 

u  = 

0.0 

0.0 

0.25 

0.0 

0.0 

0.25 

0-0 

-  0.0  _ 

.  0.0  . 

(73) 


-  0.37  ■ 

*  0.55  ’ 

0.37 

0.55 

'  3.1  ' 

0.38 

0.55 

3.1 

-2.0 

0.30 

3.0 

-1.9 

Zb  = 

0.48 

u  = 

3.2 

-1.9 

0.47 

3.1 

-1.9 

0.47 

3.2 

.  -8.3  . 

_  -2.5  . 

The  initial  and  final  positions  of  the  rigid  body  appear  as 
shown  in  Figure  2. 

The  elapsed  time  during  the  motion  interpolation  is  a 
user-defined  parameter.  For  thb  example  we  shall  choose 
At  =  2.0  seconds.  The  resulting  graphical  representation 
of  the  dynamically  interpolated  motion  is  illustrated  in  Fig¬ 
ure  3. 


Figure  3:  Dynamically  Interpolated  Motion 


CONCLUSION 

We  have  proposed  and  demonstrated  a  method  for  dynam¬ 
ically  interpolating  the  spatial  motion  of  a  rigid  body.  The 
method  relies  on  generating  the  equations  of  motion  for  a 
rigid  body  using  image  space  kinematic  entities.  The  bound¬ 
ary  value  equations  that  result  from  the  trajectory  optimiza¬ 
tion  yield  a  rigid  body  motion  that  is  dynamically  consistent 
between  the  given  terminal  states. 
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